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ABSTRACT 


We present a novel, model-independent framework for studying the architecture of an exoplanetary system at the system level. This 
framework allows us to characterise, quantify, and classify the architecture of an individual planetary system. Our aim in this en- 
deavour is to generate a uniform systematic method to study the arrangement and distribution of various planetary quantities within a 
single planetary system. We propose that the space of planetary system architectures be partitioned into four classes: similar, mixed, 
anti-ordered, and ordered. A central aim of this paper is to introduce these four architecture classes. We applied our framework to 
observed and synthetic multi-planetary systems, thereby studying their architectures of mass, radius, density, core mass, and the core 
water mass fraction. We explored the relationships between a system's (mass) architecture and other properties. Our work suggests 
that: (a) similar architectures are the most common outcome of planet formation; (b) internal structure and composition of planets 
shows a strong link with their system architecture; (c) most systems inherit their mass architecture from their core mass architecture; 
(d) most planets that started inside the ice line and formed in-situ are found in systems with a similar architecture; and (e) most 
anti-ordered systems are expected to be rich in wet planets, while most observed mass ordered systems are expected to have many dry 
planets. We find, in good agreement with theory, that observations are generally biased towards the discovery of systems whose den- 
sity architectures are similar, mixed, or anti-ordered. This study probes novel questions and new parameter spaces for understanding 
theory and observations. Future studies may utilise our framework to not only constrain the knowledge of individual planets, but also 
the multi-faceted architecture of an entire planetary system. We also speculate on the role of system architectures in hosting habitable 
worlds. 
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U Over the last 25 years, our knowledge of exoplanetary astro- 
physics has improved dramatically. While the first decade was 

S marked by sensational discoveries of individual exoplanets (e.g. 
» Vidal-Madjar et al. 2003; Santos et al. 2004; Bouchy et al. 2005; 
=" Udry et al. 2007; Kalas et al. 2008; Charbonneau et al. 2009; 
Snellen et al. 2010), we are now in an age of population-level ex- 
oplanetary statistics (for a recent review, see Zhu & Dong 2021). 

N We now know that (statistically) almost every star hosts a planet 


ee 


> and one in two Solar-like stars host a rocky planet in their habit- 

"4 able zone (Hsu et al. 2019; Bryson et al. 2021). Moreover, many 

"X exoplanet-hosting stars have multiple planets orbiting them. 

d The arrangement of multiple planets and the collective dis- 
tribution of their physical properties around host star(s) char- 
acterises the architecture of a planetary system (Mishra et al. 
2021). Exoplanets in some multi-planetary systems are thought 
to behave like ‘peas in a pod’ (Lissauer et al. 2011; Ciardi et al. 
2013; Millholland et al. 2017; Weiss et al. 2018). The peas in a 
pod trend consists of the following correlations: size, whereby 
adjacent exoplanets are either similar or ordered in size (i.e. the 
outer planet is larger); mass, whereby adjacent exoplanets are ei- 


* Catalogue of observed planetary systems used in this work is avail- 
able online at https: //cdsarc.cds.unistra.fr/cgi-bin/qcat? 
J/A*A/. 


ther similar or ordered in mass; spacing, whereby for a system 
with three or more planets, the spacing between an adjacent pair 
of exoplanets is similar to the spacing between the next consec- 
utive pair; packing, whereby smaller planets tend to be packed 
together closely and larger planets are in wider orbital configu- 
rations. 


While the statistical method used by Weiss et al. (2018) has 
been debated (Zhu 2020; Murchikova & Tremaine 2020; Weiss 
& Petigura 2020), support for the astrophysical nature of the 
peas in a pod correlations (as opposed to emerging from detec- 
tion biases) has emerged from theoretical studies and numerical 
simulations (Adams 2019; Adams et al. 2020; He et al. 2019; 
He et al. 2021; Mulders et al. 2020). In particular, Mishra et al. 
(2021) reproduced the observations from Weiss et al. (2018) us- 
ing a model of planet formation and evolution (the Bern Model 
Emsenhuber et al. (2021a,b)) and a model for the detection bi- 
ases of a Kepler-like transit survey (using KOBE). We showed 
that when nature's underlying exoplanetary population (consist- 
ing of detected and undetected exoplanets) resembles peas in a 
pod, then a population of transiting exoplanets will have correla- 
tions that are consistent with those found by Weiss et al. (2018). 
In addition, Mishra et al. (2021) suggested that the four trends 
are not independent of each other. The size correlations seem to 
emerge from the mass correlations, while the mass and packing 
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trends could combine to give rise to the spacing trend. The peas 
in a pod trends are amenable to a unification. 

Most of the current studies on this topic utilise statistical 
correlation coefficients at the population level, that is, the cor- 
relation is measured for adjacent planetary pairs from several 
planetary systems. While useful in terms of testing the existence 
(or otherwise) of architecture trends, these coefficients may have 
limited utility for analysing the architecture of a single planetary 
system. Being statistical in nature, a reliable estimate of these 
coefficients requires large datasets - which seems difficult for a 
single system. Although there are some planetary system-level 
studies (Kipping 2018; Alibert 2019; Mishra et al. 2019; Gilbert 
& Fabrycky 2020; Bashi & Zucker 2021, discussed in Sect. 3.1), 
the current literature lacks a prescription for uniformly assessing 
the multi-faceted architectures of several quantities (e.g. mass 
architecture, radius architecture, or eccentricity architecture) for 
a single planetary system. 

We seek a framework that allows us to characterise the ar- 
chitecture of an individual planetary system. Our motivations 
for developing such a framework arise from questions related 
to: formation, such as the extent to which a system’s architec- 
ture is shaped by initial conditions (i.e. the environment in and 
around the star and protoplanetary disk formation regions) (Jin 
& Li 2014; Safsten et al. 2020); evolution, the role of physical 
processes such as orbital migration or giant impacts in shaping 
the final architecture of planetary systems (Mulders et al. 2020); 
identification, which particular stars host planets that resemble 
peas in a pod, and, in particular, whether the planets in systems 
like TOI-178 (Leleu et al. 2021), Trappist-1 (Agol et al. 2021), 
or 55 Cancri (Bourrier et al. 2018) show mass/size similarities; 
other architectures, we know that there are many planetary sys- 
tems that do not follow the peas in a pod architecture (e.g. the 
Solar System). Overall, it is not obvious how the architecture of 
any individual planetary system should be uniformly assessed. 

In this series of papers, we propose a framework for examin- 
ing the architecture of planetary systems at the system level. The 
philosophy behind system level analysis is to consider the en- 
tire planetary system as a single unit of a physical system. This 
framework allows us to not only quantify, compare, and inves- 
tigate a system's architecture, but also offers some unexpected 
benefits. As it turns out, the framework allows for a conceptu- 
ally intuitive partitioning of the space of possible architectures. 
We label the four classes of planetary system architectures as: 
similar, ordered, anti-ordered, and mixed. In this way, our work 
extends the trends initiated by the notion of peas in a pod ar- 
chitecture. Furthermore, we verify the unification of the peas 
in a pod correlations proposed in Mishra et al. (2021). We find 
that, Similar architectures are the most common type of plane- 
tary system architectures and their high occurrence explains why 
the intra-system radius uniformity was already observable from 
the first four months of Kepler data (Lissauer et al. 2011). 

Our framework engenders novel questions. For instance, 
if nature produces distinct classes of architecture in multi- 
planetary systems, then what is the frequency or occurrence rates 
of these architecture classes? How does the occurrence of an ar- 
chitecture class depend on stellar and protoplanetary disk en- 
vironment? How does the architecture of a system evolve over 
time? What is the role of stellar evolution, protoplanetary disk 
interactions, and planet formation in shaping the final architec- 
ture? How is a planet's internal composition related to the sys- 
tem's architecture? Or does the ability of a planet to host life 
depends on the architecture of the planetary system? In this se- 
ries of papers, we explore these questions. Although the num- 
ber of multi-planetary systems is low today, this may change 
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in the next few decades. Thanks to large survey missions such 
as PLATO (Rauer et al. 2014), GAIA (Gaia Collaboration et al. 
2016), TESS (Ricker et al. 2015), LIFE (Quanz et al. 2022), and 
others, the growing number of known multi-planetary systems 
will allow for a better understanding to emerge. We hope our 
work encourages observers to dedicate more observation time 
to detecting planets within a known planetary system, that is, in 
finding multi-planetary systems. 

The architecture classification scheme proposed in this pa- 
per is a model-independent framework. To demonstrate our clas- 
sification framework and explore its consequences, we applied 
our framework to simulated planetary systems. To illustrate our 
framework on real systems, we also applied our framework to 
observed exoplanetary systems. We emphasise that while the re- 
sults emerging from the application of our framework on these 
datasets may suffer from some limitations (arising from theoret- 
ical modelling or detection biases for observed systems); how- 
ever, the concept of our architecture classification scheme, being 
model-independent, does not share these limitations. In this pa- 
per, we present the catalogues of planetary systems we apply our 
framework to in Sect. 2, along with a newly curated catalogue of 
observed exoplanetary systems and simulated planetary systems, 
using the Bern Model. We introduce our framework in Sect. 3. 
In Sect. 4, the characteristics of the architecture classes are dis- 
cussed. We explore the link between the internal composition 
of planets and the system architecture class in Sect. 5. Then, in 
Sect. 6, we speculate on how habitability could depend on the 
architecture of planetary systems. Our conclusions are given in 
Sect. 7. 

In a companion paper, we investigate the formation path- 
ways, i.e. the role of initial conditions and physical processes 
in shaping the final architecture (Mishra et al. (2023) referred to 
as Paper II). Our work demonstrates that the processes of planet 
formation and evolution are imprinted on the entire system-level 
architecture. We find that protoplanetary disks with low solid- 
mass give rise to planetary systems endowed with a mass similar- 
ity. On the other hand, massive disks and high metallicity often 
lead to mass Ordered, Anti-Ordered, or Mixed system architec- 
tures. Planet-planet and planet-disk interactions play a decisive 
role in shaping these three architectures. 


2. Catalogues 
2.1. Theoretical dataset: Bern Model 


In this series of works, we demonstrate our architecture frame- 
work by analysing the architecture of synthetic planetary sys- 
tems. These systems were numerically computed using the Gen- 
eration III Bern Model of planet formation and evolution (Em- 
senhuber et al. 2021a,b) that is based on the core-accretion 
paradigm of planet formation (Pollack et al. 1996; Alibert et al. 
2004, 2005). The model follows the growth of protoplanetary 
embryos embedded in a protoplanetary disk of gas and solids 
around a solar-type star. A diverse range of physical processes 
are simultaneously occurring and coherently computed in this 
1D star-disk-embryo system. These include: stellar and disk 
physics (evolution of and interaction between star and viscous 
disk, condensation of volatile and refractory species, etc.), plane- 
tary formation physics (accretion of planetesimals and gases, in- 
ternal structure calculations, etc.), and additional physics (orbital 
and tidal migration, planet-planet N-body interactions, planet- 
disk interactions, atmospheric escape, deuterium fusion, etc.). 
We describe these physical processes in Sect. A and a descriptive 
summary of these processes is provided in Mishra et al. (2021, 
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Fig. 1. Mass-distance diagram. This figure shows the masses and the 
distances of planets in all catalogues used in this study. Shaded regions 
show the parameter space spanned by synthetic planets observed via 
radial velocity surveys (Bern RV Multis), transit surveys (Bern KOBE 
Multis), and ongoing missions (Bern Compact Multis). The parameter 
space for Bern KOBE Multis has been mapped from its original radius- 
period plane. 


in particular, Fig. 1 and Sections 2, 3, and Appendix A). More 
details can also be found in (Emsenhuber et al. 2021a,b). 

We synthesised 1000 planetary systems, each starting with 
100 lunar mass protoplanetary embryos, wherein the following 
initial conditions were varied: mass of protoplanetary gas disk, 
photo-evaporation rate, dust-to-gas ratio, disk inner edge, and 
the starting location of embryos. In Fig. 1, we show all synthetic 
planets on the mass-distance diagram. For each synthetic plane- 
tary system failed embryos, objects with mass less than 0.1 Ms, 
were removed from further analysis!. 

Three observationally motivated catalogues were prepared 
from the synthetic dataset. This allowed us to facilitate a compar- 
ison of the architecture from observed planetary systems with the 
synthetic planetary systems and to make predictions. The param- 
eter space spanned by the planets in these catalogues is shown in 
Fig. 1. These catalogues are as follows: 

Bern RV Multis: We assume a radial velocity (RV) survey 
which can find planets with periods < 15 yr and semi-amplitude 
Kry = 20 cm/s. These numbers are motivated by (a) long- 
running RV surveys such as the HARPS survey (Mayor et al. 
2003, 2011) and the California Legacy Survey (Rosenthal et al. 
2021b; Fulton et al. 2021); (b) current precision achieved by 
ESPRESSO (Lillo-Box et al. 2021; Netto et al. 2021); and (c) 
making predictions for future RV surveys. Such RV detectable 
synthetic planetary systems with four or more planets form the 


! As long as the mass threshold for failed embryos is kept under 0.1 Me, 
the results presented in this paper are not sensitive to the threshold limit. 
We removed these small objects since they (a) failed to grow as massive 
planets, (b) are insignificant to the dynamical evolution of the system, 
and (c) are currently unobservable in exoplanetary systems. All results 
arising from the Bern RV Multis, Bern KOBE Multis, and Bern Com- 
pact Multis are insensitive to these failed embryos. 


Bern RV Multis catalogue, which includes 3 828 planets around 
565 stars. 

Bern KOBE Multis: We assume a Kepler-like transit survey 
which continuously observes 2 x 10? stars for 3.5 yr (Thompson 
et al. 2018). A planet which transits three or more times and pro- 
duces a transit S/N of 7.1 or more is considered detectable. The 
reliability and completeness of such a survey is replicated and 
those synthetic planets which would have been vetted as “plan- 
etary candidates' by the Kepler Robovetter (Thompson et al. 
2018), are kept. Such transiting synthetic planetary systems with 
four or more planets form the Bern KOBE Multis catalogue. 
KOBE was developed and introduced in Mishra et al. (2021). 
There are 6 715 planets around 1283 stars in this catalogue. 

Bern Compact Multis: Ongoing transit missions such as 
CHEOPS and TESS have been successful in characterising com- 
pact multi-planetary systems, such as TOI-178 (Leleu et al. 
2021) and TOI-561 (Lacedelli et al. 2021). Inspired by these dis- 
coveries, we investigated the architecture of compact planetary 
systems simulated by the Bern Model. Our aim is to understand 
the architecture and make predictions for such systems based on 
the core-accretion paradigm (Pollack et al. 1996; Alibert et al. 
2004, 2005). All planets with periods of x 100 d and masses of 
> 0.1 Mg were retained. Synthetic planetary systems, in this pa- 
rameter space, with four or more planets form the Bern Compact 
Multis catalogue, with 2 412 planets around 400 stars included. 


2.2. Observational dataset: A new catalogue 


To demonstrate our framework on observed exoplanetary sys- 
tems, we have curated a new catalogue of known multi-planetary 
systems’. A salient feature of this catalogue (and the philosophy 
behind this work) is its focus on considering planetary systems 
as a single unit of a physical system. Unlike focussing on in- 
dividual exoplanets or a single detection technique, our aim is 
to study the planetary system as a whole. There are two serious 
challenges to this endeavour. Firstly, the biases present in de- 
tection methods tend to prevent a complete, reliable picture of 
an exoplanetary system from emerging (either via undetected or 
mischaracterised planets). Secondly, detecting planets on long 
orbital periods requires long-term, repeated observations, which 
is considerably challenging. We hope that upcoming missions 
and future surveys can mitigate these difficulties. 

We included a planetary system in our catalogue if: (a) it has 
at least four known planets and (b) masses are available for at 
least four planets. For example, Kepler-33, a five planet system, 
is included because mass measurements are available for four 
of its planets?. The criterion of requiring minimum four plan- 
ets emerges due to (a) the requirement for enough planets for 
adequately characterising the architecture and (b) because for 
systems with lower number of planets, it is perhaps difficult to 
uniformly assess whether the low multiplicity is an outcome of 
natural processes or detection biases. To keep the comparison be- 
tween observations and theory uniform, all catalogues in this se- 
ries of works only consider planetary systems with four or more 
planets. The architecture framework can, however, handle two- 
or three-planet systems as well. To make this catalogue useful 
to the wider community and enable future studies, we gathered 
several key stellar and exoplanetary properties. For host stars, we 
report the mass, radius, luminosity, effective temperature, metal- 
licity, age, and distance, along with their identification numbers 


? The catalogue was last updated in April 2021. 
3 For this study, the distinction between mass and minimum mass is 
ignored. 
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Table 1. Observed multi-planetary systems: There are 41 planetary systems with 194 planets in this catalogue. Only the first five rows are shown 
here. The entire table is available online. Online version includes additional identification columns: KIC ID, TIC ID, and GAIA ID. Missing 
information is indicated by ‘~’. References for individual systems are given in appendix Sect. B. 


Stellar parameters 


Hostname Multiplicity ^ M. [Mo] R, [Ro] Ly [Lo] Teg[K] [Fe/H] Age [Gyr] Distance [pc] 
Sun 8 1 1 1 5,772 0 04.6+ 0.1 0 
Trappist-1 7 0.1: 0.002 0.1 0.001 5.53e—- 04 2,566+ 026 +0.04+ 0.08 07.64 2.2 012.0 
TOI-178 6 0.7 + 0.03 0.7 € 0.01 0.14 01.08 4,316+ 070 -0.23+ 005 07.14 6.1 0627 
HD 10180 6 1.1 + 0.05 1.1+ 0.04 15+ 00.00 5,911+ 019 +0.08+ 0.01 043205 039.0 
HD 219134 6 0.8 + 0.03 0.8+ 0.005 0.3+ 00.01 4,700+ 020 +0.11+ 0.04 1104+ 2.2 006.5 
Planetary parameters 
Hostname Planet My[Ms] Ry[Re] ap[AU] e i[^] min. Mp 
(n ve: = 00.055 + 00.000, 0.383 + 0.000, 0.387 + -, 0.21 € -, 7.00 € -, 
Sun ( itt 00.815 + 00.000, 0.949 + 0.000, 0.723 + -, 0.01 + -, 3.39 € -, (EEEF...) 
k d, 01. 374- + - 00. 069, 1. 1164 + -Q. 014, 0.012 + 0.000, 0.01 x 0.00, 89.73 + 0.17, 
Trappist-1 f pow * 01.308 + 00.056, 1.097 + 0.014, 0.016 + 0.000, 0.01 + 0.00, 89.78 + 0.12, (REEF...) 
t T 01.5002 00.440.) (1.152 + 0.073,) (0.026 + 0.001, Ede 88.80 + 1.30, 
TOI-178 a E 04.770 + 00.680, 1.669 + 0.114, 0.037 x 0.001, -+-, 88.40 + 1.60, (REEF...) 
" 13.222 + 00.445, 0.064+0.001,) — (0.07 + 0.03, Lakes 
HD 10180 i 12.014 + 00.699, -t-, 0.129 + 0.002, 0.13 + 0.05, -+-, (REEL ) 
04.620 + 00.140,) (1.544 + 0.059, 0.039 + -, 0.00+-, ) (85.19 + 0.13, 
HD 219134 04.230 + 00.200, 1.458 + 0.048, 0.065 + -, 0.06 + 0.04, 87.38 + 0.10, (FETT...) 


(when available) in the Kepler Input Catalogue (KIC), TESS In- 
put Catalogue (TIC), and GAIA ID. For planets, we report mass 
or minimum mass, radius, semi-major axis, eccentricity, and in- 
clination. In a conservative approach, errors (reported when pos- 
sible) are the maximum of the upper and lower error bounds 
available in the literature. When multiple publications reported 
planetary parameters, a more recent publication was preferred. 
When a single publication reported parameters for all planets in 
a system, then such a consistent set of solution was given pref- 
erence (e.g. GJ 676 A or Kepler-11). For stellar parameters, if 
a star was included in KIC, then the values from Berger et al. 
(2020) are reported. Most other stellar parameters come from 
the TIC (Stassun et al. 2019) or from individual publications. 


There are 41 planetary systems that meet our criteria and de- 
fine our multi-planetary system catalogue (Table 1). With a total 
of 194 planets in our catalogue, the number of planetary systems 
with four, five, six, seven, and eight planets is 24, 7, 8, 1, and 1. 
In this paper, we present the observed planetary systems as they 
are known today and we do not correct the observations for any 
detection biases. Instead, to assist in making comparisons with 
the theory, detection biases will be placed on simulated plane- 
tary systems (Sect. 2.1). Figure 1 shows the mass of observed 
exoplanets as a function of their semi-major axis. 


While our observed multi-planetary systems catalogue en- 
genders system-level studies, its current form poses several tech- 
nical difficulties. Foremost, the number of observations is only 
forty-one. Secondly, multiple detection methods, such as radial 
velocity or transits (etc.) were employed to observe these plan- 
etary systems. Each observation technique suffers from certain 
limitations and detection biases. This implies that the observed 
systems in our catalogue do not constitute a homogeneous and 
complete set of observations. These two limitations of the ob- 
servations catalogue prohibit us from deducing any statistically 
strong result. Nevertheless, we used the observed systems for 
(a) exemplifying system-level approach to real planetary systems 
and (b) using our framework on observations to explore trends 
in the architecture of observed systems. 


Article number, page 4 of 28 


Our results from the observed catalogue may be affected by 
another source of difficulty. There are two systems in our cata- 
logue that host some planets without known mass measurements 
(Kepler-33 b and Kepler-80 f and g). Since these two systems 
have at least four planets with known masses, they have been 
included in our study. However, this does not impact the results 
of the present study in a drastic way. All three planets in these 
systems without mass measurements are either the innermost 
and/or the outermost planets in their respective systems. There- 
fore, the missing measurements do not have a strong influence 
on the characterisable mass architecture. The missing measure- 
ment may have a strong effect if any planet with unknown mass 
was in between two planets with known masses. 


3. Characterizing architecture: A new framework 
3.1. Literature review 


We review some approaches from other studies that have tried to 
capture planetary system-level properties in this section. Kipping 
(2018) investigated similarity and ordering (of planetary sizes) at 
the level of an individual system. Using an entropy based frame- 
work on Kepler systems, he concludes that initial conditions 
are inferable from the present-day architecture. As we go on to 
show in this series, our work not only supports this conclusion, 
but additionally demonstrates the possible links between initial 
conditions and final architecture. Although the above-mentioned 
study considers a similar problem to the one we deal with here, 
our frameworks differ considerably. Built on step-functions and 
combinatorics, the aforementioned framework does not take into 
account the magnitude of variation. 

Alibert (2019) proposed a concept of distance between two 
planetary systems. The Alibert distance captures inter-system 
differences, whereas our framework quantifies intra-system sim- 
ilarities. The Alibert distance is useful to quantify the similarity 
(or dissimilarity) between two planetary systems and in unsuper- 
vised machine-learning algorithms to find clusters in the space 
of planetary systems. Bashi & Zucker (2021) recently proposed 
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another concept for distance based on a statistical distance. The 
‘weighted’ energy distance is the distance between two plane- 
tary systems, with each planet represented on the log-period and 
log-radius plane, utilising planetary masses (from a mass-radius 
relationship) as weights. As with the Alibert distance, the Bashi- 
Zucker distance requires two planetary systems and thus it is 
not suitable for characterising the global architecture for a single 
planetary system. 

Gilbert & Fabrycky (2020) proposed seven parameters for 
quantifying the global structure of planetary systems: dynam- 
ical mass (ratio of mass in planets to stellar mass), mass par- 
titioning (normalised mass disequilibrium), mass monotonicity 
(weighted Spearman correlation coefficient), characteristic spac- 
ing (average mutual Hill radii), gap complexity, flatness, and 
multiplicity (n). Of these measurements, mass partitioning and 
mass monotonicity have close parallels with our framework. The 
input information required to compute mass partitioning, and 
monotonicity is exactly the same as the input information for 
our architecture framework, namely, a set of planetary masses. 
However, we find that the output displays a curious mix of con- 
cepts. 

Mass partitioning is zero for a system in which all planets 
have the same mass. When one planet has some mass and all 
other planets have negligible mass, the mass partitioning for this 
system is unity. While this parameter captures the two extreme 
cases, it is difficult to interpret and employ this measure in cases 
other than these two extremes. Behaving similarly to a correla- 
tion coefficient, mass monotonicity has a range of [-1,1]. It is de- 
fined as the Spearman correlation coefficient (between mass and 
distance) multiplied by the mass partitioning (which is weighted 
by n^!). Although the work of Gilbert & Fabrycky (2020) stud- 
ies the architecture of planetary systems at the system-level, we 
seek a framework which can also be used with planetary prop- 
erties other than mass, such as radius, bulk density, water mass 
fraction, eccentricities, and so on. 

Millholland et al. (2017) and Wang (2017) showed that the 
peas in a pod pattern reported by Ciardi et al. (2013); Weiss 
et al. (2018) also extends to planetary masses. Millholland et al. 
(2017), using planetary masses derived from transit-timing vari- 
ations, studied the clustering of planets in the mass-radius plane 
and found that the sum of distances (in the log mass-size space) 
between adjacent planets of real systems is much smaller than 
a bootstrapped randomised population. Based on a set of 29 RV 
observed systems, Wang (2017) infer two types of planetary sys- 
tems. Planetary systems with masses of < 30Mo show intra- 
system mass uniformity, while systems with masses = 100M, 
do not follow the peas in a pod pattern — indicating that there are 
only two possibilities for the architecture structure. As we show 
in this series of works, their hypothesis of only two architecture 
types is too simple and cannot capture the richness of physics. 


3.2. Concept 


With our framework, we initially aimed to capture the key aspect 
about the peas in a pod architecture trends. These trends are cor- 
relations between adjacent planets or between consecutive pairs 
of adjacent planets. We want to capture these ideas at the level 
of a single planetary system through a unified framework. We 
do this by studying how a quantity, g;, (such as mass, size, or 
period ratio) varies for all planets within a system. Here, i in- 
dexes the planets within a system. For all quantities, we adopt 
an ‘inside-out’ convention, namely, we start with the innermost 
planet (gj=1) and go to the next adjacent planet (g;-2), and so 
on. By comparing how q; varies for each planet inside-out, we 
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Distance from star 


Fig. 2. Classes of architecture. This schematic diagram shows the four 
architecture classes: similar, anti-ordered, mixed, and ordered. Depend- 
ing on how a quantity (e.g. mass or size) varies from one planet to an- 
other, the architecture of a system can be identified. 


are actually estimating how q; varies with distance from the host 
star. 


In comparing a quantity, q;, with distance, four kind of vari- 
ations emerge. In one scenario, a quantity could show little to no 
variation. In another case, the value of a quantity may increase 
with increasing distance or, conversely, the quantity could de- 
crease from one planet to another. Finally, it is also possible for 
a quantity to not have any clear variations from one planet to 
another. We identify these four scenarios as the four classes of 
architectures that can exist at the level of a single planetary sys- 
tem. This idea is depicted in Fig. 2. 


Mishra et al. (2021) suggested that the mass correlations 
could originate from planet-formation physics and the correla- 
tions of size and spacing could be derivative. Therefore, we first 
apply our framework using planetary masses (except in Sects. 
5 and 6). As depicted in Fig. 2, when the masses of all planets 
within a system are similar to each other, we label the architec- 
ture of such systems as 'similar'. This architecture class corre- 
sponds to the peas in a pod architecture reported in observations 
(Weiss et al. 2018; Millholland et al. 2017). When the masses of 
planets tend to increase inside-out, the architecture of such sys- 
tems is labelled *ordered'. If the planetary mass tends to decrease 
from the inner planet to the outer, we label the architecture of 
these systems as 'anti-ordered'. Finally, if a large increasing and 
decreasing variation in the planetary masses is present, we label 
the architecture of such systems as *mixed'. The mixed architec- 
ture class is also useful in capturing all other architecture pat- 
terns which do not fall under the other three architecture classes. 
Kipping (2018), for example, has analysed some interesting re- 
peating patterns. By introducing these architecture classes, our 
framework organises the possibilities for system architecture. 
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One might wonder, at this point, why introduce such a con- 
cept and the ensuing mathematical machinery? While part of this 
work began as an inspired exploration to categorise our under- 
standing of system architecture, it turns out that there are good 
physical reasons to pursue this process. As is shown in this and 
a companion paper, planetary systems that have the same archi- 
tecture tend to have a host of other properties in common, such 
as internal structures (core-mass, ice-mass) distributions. Most 
importantly, systems with a common architecture tend to have 
same formation pathways, initial conditions, and evolutionary 
histories. Practically, this means that a quick glance at a system’s 
architecture may reveal a lot more about its formation scenario. 

Our architecture classification framework utilises two quan- 
tities — the coefficient of similarity and the coefficient of vari- 
ation, introduced in Sects. 3.3 and 3.4, respectively. These two 
coefficients allow us to quantify the conceptual ideas we have 
presented above. Together, these coefficients define a new space 
of possibilities for system architectures. In Sect. 3.5, we iden- 
tify the regions of this architecture space that correspond to the 
four architecture classes introduced above. As this framework 
deals with the architecture of multi-planetary system, systems 
with only one planet are not studied within this framework. 


3.3. Coefficient of similarity 


The term ‘coefficient of similarity’ is commonly used in the 
fields studying statistics of ecology and genetics (Gower 1971; 
Dalirsefat et al. 2009). We borrow the term but develop our 
own concept and definition.Let q be a planetary quantity such 
as mass, size, period ratios of adjacent planets, bulk density, ec- 
centricity, and so onf. The value of this quantity for the i'" planet 
in a system is denoted by q;. The coefficient of similarity, Cs, 
measures how q changes from one planet to another, inside-out. 
For a system with n planets, it is defined as: 


i=n-1 
[vo diel } (1) 
i=l di 


There is a clear physical interpretation for Cs (q): the coefficient 
of similarity measures the average order of magnitude variation 
in the quantity q from one planet to another. The definition of 
the coefficient of similarity allows us to map the architecture of 
a planetary system on a one dimensional axis. When Cs(g) « 
0, then the system's architecture could imply a similarity in q. 
When C; (q) is positive, then planets within a system are ordered 
in q. Conversely, Cs (q) being negative, implies that the planets 
are anti-ordered. 

We have developed a mathematical formalism to study the 
sensitivity of the coefficient of similarity. In Appendix C, we de- 
rive the limiting values of the coefficient of similarity and present 
the results here. For example, when the q; values for all planets 
in a system are within 10% of each other, then the maximum 
possible value of Cs(q) is 0.09 (see Eq. C.10). For maximum 
tolerances of 20%, 40%, 60%, and 80%, the maximum possi- 
ble value of Cs (g) are 0.18, 0.37, 0.60, and 0.95 respectively. 
In Fig. C.1, we show the dependence of the max Cs (q) on t. 

The coefficient of similarity cannot distinguish between two 
classes of architecture: similar and mixed. Systems which show 
similarity will have Cs(q) ~ 0. However, system with mixed ar- 
chitecture have large increasing and decreasing variations, such 


^ For quantities which admit zero as a possible value, the coefficient of 
similarity may become ill-defined. This is a coordinate singularity and 
can be dealt with an appropriate treatment (see Eq. 4 Sect. 5.4). 
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that the log of ratios cancels itself out. Such systems will also 


have Cs(q) « 0. We propose the coefficient of variation to distin- 
guish these two architecture classes. The coefficient of similarity 
depends on the actual order in which planets exist (inside-out) in 
a system. As we go on to show, the coefficient of variation does 
not depend on the ordering of planets in a system. 


3.4. Coefficient of variation 


The coefficient of variation, Cy, is a standard descriptive statistic 
used to measure the magnitude of variation in a set of numbers 
(Katsnelson & Kotz 1957; Sharma et al. 2010; Abdi 2010). It is 
defined as the ratio of the standard deviation with the mean: 


Cy(q) = 79. Q) 


The coefficient of variation is a positive quantity. When all 
qi have the same value then Cy(q) = 0. Planetary systems con- 
sisting of planets that have a small (or large) variability in their 
q; values will have a small (or large) value of the coefficient of 
variation. Now, the distinction between systems showing simi- 
larity and mixed architecture is clear. While similar systems will 
have a low value of the coefficient of variation, mixed systems 
will have a high value of coefficient of variation. 

Since this coefficient is a well known statistical measure, 
there are some derivations for its limit. A classical result from 
Katsnelson & Kotz (1957) shows that, for a set of n numbers, 
the maximum value of the coefficient of variation is Vn — 1. 
However, this result is only a particular case in our setup. In Ap- 
pendix C, we develop a mathematical formalism to understand 
the limits of the coefficient of variation and present the results 
here. When the q; values for all planets in a system are within 
10%, 30%, 50%, 70%, and 90% of each other, the absolute the- 
oretical upper limit of Cy(q) is 0.10, 0.31, 0.58, 0.98, and 2.06 
respectively. Figure C.1 shows how this upper limit varies with 
the maximum tolerance, f, for a system. 


3.5. Classifying the architectures of planetary systems 


We are interested in obtaining a mapping from the scale- 
invariant coefficients to an architecture class. In Appendix D, 
we present some considerations that motivate the selection of 
boundaries between the four classes. The selected boundaries 
were additionally tested on thousands of mock planetary systems 
to check their ability to correctly classify the four architecture 
classes. We propose the following boundaries for identifying the 
architecture class based on planetary masses. 


Architecture class Condition 
Anti-ordered Cs(M) « -0.2 
Ordered Cs(M) > +0.2 
yn- 1 
Similar ICs (M)| < 0.2 and Cy(M) < E 
yn- 1 
Mixed ICs(M)| € 0.2 and Cy(M) > — (3) 


2 


A natural (and welcome) outcome of these criteria is that a 
two-planet system can never have a mixed class architecture. The 
boundary between similar and mixed class is half the maximum 
possible value of the coefficient of variation. For the solar sys- 
tem, Cs(M) = 0.36 and Cy(M) = 1.85. This framework robustly 
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Fig. 3. New parameter space: Architectures of planetary systems. Both panels shows the coefficient of similarity (mass) as a function of the 
coefficient of variation (mass). The shaded regions show the allowed parameter space for planetary systems. The white gaps (between two shaded 
regions) mark the mathematically forbidden regions of this architecture space. Different parts of this parameter space are identified with four 
architecture classes, in accordance with Eq. 3. Each point corresponds to an individual planetary system. For visual clarity, the shaded and 
unshaded regions are drawn only for systems hosting up to fifteen planets. Left: Planetary systems from the Bern model and observations. Right: 
Synthetically observed systems depicting the detection biases of radial velocity and transit surveys. 


identifies the architecture of the solar system as ordered’. This 
classification is in line with the historic understanding of the so- 
lar system architecture: small rocky planets on the inside and 
giant planets on the outside. If, however, Neptune were replaced 
with an Earth-like planet, the architecture of the solar system 
would be classified as mixed. Considering only the inner four 
planets of the solar system, Cs (M) = 0.10 and Cy(M) = 0.85, 
would make the architecture of the inner solar system belong to 
the similar class. The architecture of the outer four giants in the 
solar system is anti-ordered and we have Cs(M) = —0.42 and 
Cy(M) = 1.11. 

Figure 3 shows the Cs(M) versus Cy(M) space for plane- 
tary systems from several catalogues. The Bern model planetary 
systems occupy all four regions of this architecture space. Ob- 
served planetary systems, however, span only a limited region 
of this parameter space, given the low multiplicity of observed 
planetary systems. The architecture space spanned by the ob- 
served planetary systems (shaded contour) is in agreement with 
the synthetically observed planetary systems from Bern Com- 
pact Multis, Bern KOBE Multis, and Bern RV Multis. 

The architecture for the systems in the synthetically observed 
catalogue was calculated based only on the planets that were de- 
tected (for RV/KOBE) or included (for Bern Compact Multis) in 
the above-mentioned catalogue. It is theoretically possible for a 
single Bern model system to exhibit different architectures de- 
pending on the planets which are detected or included. The re- 
verse is also true — the architecture of an observed planetary sys- 
tem may change if new planets are discovered or old controver- 
sial candidates are rejected. While the ground truth architecture 
for observations seems elusive, a comparison with synthetic ob- 


? Even if the masses of each solar system planet were randomly varied 

within 8596 of their original values, the emerging architecture is still 
ordered. With 1M trials, varying the masses randomly within 90% of 
their original values lead to ordered (for ~ 99.45% trials), mixed (for 
= 0.55% trials), and similar (for ~ 0.001% trials) architectures. 


servations can bring forth patterns which are unexpected. With 
this in mind, we consider the following example. 


Detection biases, in both radial velocities and transits, gen- 
erally disfavour the discovery of less-massive and small planets 
at larger distances. This implies that anti-ordered architectures 
are difficult to detect. In fact, we have no known example of a 
planetary system showing anti-ordered architecture in our obser- 
vations catalogue. This is surprising for two major reasons: (a) 
theory suggests their existence: there are several synthetic plan- 
etary systems from the Bern Model whose architecture is anti- 
ordered; (b) theory suggests their discovery: all three syntheti- 
cally observed catalogues contain some (albeit few) anti-ordered 
planetary systems. Since the number of systems in our catalogue 
is too low, we refrain from making any conclusions and, instead, 
we await the discovery of anti-ordered architectures in the future. 
However, if such architectures are not found despite considerable 
efforts, this result will become a strong indicator for shaping our 
understanding of planet formation. 


Another aspect of this new architecture space is the underly- 
ing mathematical structure. In Fig. 3, the shaded areas shown 
regions where a planetary system, with n € [2, 15] planets, is al- 
lowed. A system with two planets, for example, can only occupy 
the shaded region labelled ‘n = 2’. All non-shaded regions (in 
white — except the shaded regions for 16 or more planets which is 
not drawn), on this architecture space, is mathematically forbid- 
den. These are parts of the architecture parameter space that no 
planetary system, irrespective of its configuration, can occupy. 


$ Visualizing this structure is easy (not shown). (a) Construct mock 
planetary systems with masses, for each mock planet, randomly drawn 
from a uniform distribution with suitable limits. (b) It is suggested to 
vary the number of planets in these mock systems randomly. (c) Calcu- 
late the C; (M) and the Cy(M) using equations 1 and 2. (d) Plot C; (M) 
versus Cy(M) for this mock population. For large number of systems 
the plot should be symmetric about Cs (M) = 0. 
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This strong result stems from the mathematical limits that were 
derived for this work (see Sects. 3.3, 3.4, and appendix C). 

For clarity and future convenience, we introduced some ter- 
minology to the method. When the architecture framework (i.e. 
Cs and Cy) is applied on planetary bulk masses, the resulting in- 
formation tells us the mass architecture of a system, namely, the 
arrangement and distribution of masses in said system. Similarly, 
when this framework is applied on radii, it gives us the radius ar- 
chitecture (arrangement and distribution of radii) for the system 
(Sect. 5.1). Similarly, we can obtain the bulk-density architecture 
(Sect. 5.2), core-mass architecture (Sect. 5.3), water mass frac- 
tion architecture (Sect. 5.4), period-ratio or spacing architecture, 
eccentricity-architecture, and so on. In this series of papers, we 
identify a system’s architecture based on its bulk mass ar- 
chitecture. Thus, when a system is said to be similar, we are 
referring to the similarity in terms of the mass architecture. 


4. Characteristics of architecture classes 
4.1. General comments 


In earlier studies on the peas in a pod architecture, the strength of 
population-level (i.e. across many planetary systems) trends was 
quantified using Pearson correlations coefficient (Weiss et al. 
2018; Zhu 2020; Chevance et al. 2021; Millholland & Winn 
2021; Mishra et al. 2021). The correlation coefficients were cal- 
culated using planetary quantities in the log space (i.e. by first 
taking the /ogio of all quantities). This resulted in higher values 
of the correlation coefficient since quantities have limited range 
to perambulate in the log space. Consider planetary masses. We 
calculated the correlation coefficient between the mass of adja- 
cent inner and outer planets in the Bern model population (see 
Fig. 7 in Mishra et al. 2021). The value of the coefficient is 0.66 
in the log space and 0.16 in the linear space. This highlights that 
the planetary masses are more closely clustered in log than in 
linear space. 

We tested the same correlation for all systems in each ar- 
chitecture class. We expect planetary masses in mixed, ordered, 
and anti-ordered systems should (by definition) have low cor- 
relations. On the other hand, similar class architecture should 
exhibit a strong correlation. Surprisingly, in log space all archi- 
tecture classes show strong correlations. The coefficient value 
is 0.67 for similar class, 0.69 for mixed class, 0.50 for ordered 
class, and 0.58 for anti-ordered class architectures. However, in 
the linear space the coefficient values reflects our expectation: 
0.61 for the similar class, 0.20 for the mixed class, 0.16 for or- 
dered class, and 0.05 for anti-ordered class. This underscores 
that strong correlations in the log space may not be indicative of 
substantive architecture trends. It also shows that our framework 
is capable of identifying systems in which the 'peas in a pod’ 
architecture is discernible even in the linear space. 

For all 41 observed planetary systems in our catalogue, we 
report their architecture classes in Table 2. Figure 6 shows the 
architecture of all observed multi-planetary systems in our cat- 
alogue. The systems are sorted by their coefficient of similarity 
values. The figure also shows the four classes of architecture for 
a few randomly selected synthetic planetary systems. To under- 
stand the characteristics of the different architectures, we study 
the distribution of planetary masses, radii, and semi-major axes 
as well as the multiplicity distributions. For planetary systems 
across all catalogues, this is shown in Fig. 7. We describe the 
characteristics of different architectures in the following subsec- 
tions. The discussion in the next subsection involves results de- 
rived from both observed and synthetic planetary systems. In ad- 
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Table 2. Architecture type of known multi-planetary systems (see Table 
1 for catalogue and Fig. 6 for architecture plot). 


Hostname Multiplicity Cs(M) — Cy(M) Architecture Class 
Solar System 8 +0.36 1.85 Ordered 
Trappist-1 7 -0.10 0.45 Similar 
TOI-178 6 +0.08 0.46 Similar 
HD 10180 6 +0.14 0.66 Similar 
HD 219134 6 +0.27 1.49 Ordered 
HD 34445 6 +0.17 0.84 Similar 
Kepler-11 6 +0.22 1.03 Ordered 
Kepler-20 6 +0.00 0.44 Similar 
Kepler-80 6 —0.00 0.19 Similar 
K2-138 6 +0.03 0.61 Similar 
55 Cnc 5 +0.52 1.37 Ordered 
GJ 667 C 5 —0.02 0.29 Similar 
HD 158259 5 +0.11 0.29 Similar 
HD 40307 5 +0.07 0.33 Similar 
Kepler-102 5 +0.02 0.41 Similar 
Kepler-33 5 +0.46 0.67 Ordered 
Kepler-62 5 +0.15 0.68 Similar 
HD 20781 4 +0.29 0.59 Ordered 
TOI-561 4 +0.33 0.64 Ordered 
DMPP-1 4 +0.29 0.81 Ordered 
GJ 3293 4 +0.27 0.62 Ordered 
GJ 676A 4 +0.90 0.99 Ordered 
GJ 876 + +0.12 1.20 Mixed 
HD 141399 4 +0.06 0.40 Similar 
HD 160691 4 +0.63 0.82 Ordered 
HD 20794 + +0.08 0.25 Similar 
HD 215152 + +0.07 0.23 Similar 
HR 8799 4 —0.07 0.17 Similar 
K2-266 4 +0.03 0.60 Similar 
K2-285 4 +0.01 0.31 Similar 
Kepler-89 4 +0.17 0.91 Mixed 
Kepler-106 4 +0.11 0.26 Similar 
Kepler-107 4 +0.13 0.42 Similar 
Kepler-223 4 —0.06 0.22 Similar 
Kepler-411 4 —0.08 0.34 Similar 
Kepler-48 4 +0.74 1.64 Ordered 
Kepler-65 4 +0.68 1.63 Ordered 
Kepler-79 4 —0.10 0.24 Similar 
WASP-47 4 +0.59 0.95 Ordered 
tau Cet 4 +0.12 0.37 Similar 
HD 164922 4 +0.49 1.29 Ordered 


dition, we present a gallery of mass-distance diagrams showing 
the four architecture classes in Appendix E. 


Figure. 4 shows the coefficient of similarity of masses as a 
function of the total planetary mass in a system for all synthetic 
planetary systems from the Bern model. This figure shows sev- 
eral key aspects. Firstly, it illustrates the four architecture classes 
as separate clouds of scattered points strengthening the proposed 
four classes of planetary system architecture. Secondly, it shows 
that the architecture framework is scale-invariant, that is, the 
system architecture is sensitive only to the relative distribution 
of a quantity — and not its absolute value. For example, while 
most similar system have g 100Mg mass in their planets (sug- 
gesting a lack of giant planets), there are some similar systems 
with mass values of ~ 2000Mg for their planets and host giant 
planets. Likewise, most ordered systems host giant planets and 
have z 2000M, mass in their planets, there is an ordered sys- 
tems without any giant planets. Also, it illustrates that the coeffi- 
cient of similarity partitions planetary systems into three groups: 
anti-ordered, similar and mixed in one group, and ordered. This 
demonstrates that the coefficient of variation is necessary to dis- 
tinguish between the similar and mixed systems. Finally, the di- 
agram shows that the architecture class of a system has strong 
links with the total mass of planets in the system. This hints that 
there must be general patterns in the formation pathways of sys- 
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Fig. 4. Four classes of system architecture. The diagram shows the coef- 
ficient of similarity for a system as a function of the sum of mass of each 
planet in a system. Dashed horizontal lines correspond to C; = +0.2. 
This diagram emphasises the four classes of planetary system architec- 
ture, namely: anti-ordered, similar, mixed, and ordered. It also shows 
that the coefficient of similarity can not distinguish between similar and 
mixed architectures. 


tems of the same architecture. This topic is discussed in Paper II, 
from this series. 


4.2. Frequency of architecture 


The frequency of each architecture class across all catalogues is 
shown in Fig. 5. Similar systems are the most common archi- 
tecture classes emerging from simulations, with a frequency of 
= 80.2%. About « 8% of synthetic systems show mixed and 
anti-ordered architectures. Ordered architecture is a rare out- 
come in simulations (z 1.596). In observations, similar class is 
the most common architecture (~ 59%). Fifteen observed exo- 
planetary systems (out of forty-one) are part of the ordered ar- 
chitecture class (x 37%). About ~ 5% of observed planetary 
systems show mixed architecture. There are no known examples 
of observed system with anti-ordered architecture. 

Comparing the frequency of architecture classes for ob- 
served systems with synthetically observed systems brings out 
some peculiar features. Firstly, theoretical catalogues seem to 
suggest that observations should find more similar systems and 
fewer ordered systems. The frequency of similar (ordered) sys- 
tems in our observed catalogue is significantly lower (higher). 
Secondly, while the frequency of mixed systems seems to be in 
agreement with synthetic observations, this agreement is not sta- 
tistically significant. 

These discrepancies probably arise from the incompleteness 
prevalent in our observations catalogue. Transit surveys are con- 
ducted in a manner which allows the completeness and reliability 
of these survey to be estimated. The completeness of RV surveys, 
on the other hand, is very difficult to estimate. Further, the obser- 
vation techniques used to find the exoplanets in our observations 
catalogue are heterogeneous, consisting of RV, transits, transit- 
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Fig. 5. Frequency diagram for the architecture classes. Currently, there 
are no known examples of observed planetary systems with anti-ordered 
architecture. The length of error bars visualises the total number of sys- 


tems in each bin as: 100/ vbin counts. 


timing variations, and direct imaging; this complicates the es- 
timation of completeness. The PLATO mission is an upcoming 
space mission that is equipped to allow for statistical estimates 
of cosmic occurrence rates of planetary system architecture in 
our galaxy (Rauer et al. 2014). If more exoplanetary systems are 
uniformly detected and characterised, then it would be possible 
to estimate the occurrence rate of the different classes of system 
architecture. While such a result would constitute an important 
knowledge about our Universe, it could also become an excel- 
lent way of constraining our knowledge of initial conditions for 
planetary formation and the physical processes which shape the 
system architecture. The frequency of architecture class in sim- 
ulations is a direct consequence of the initial conditions and the 
physical processes modelled in the Bern model. 


4.3. Architecture class: similar 


Planetary systems have a similar architecture when all planets in 
the system have masses that are approximately similar to each 
other. These planetary systems are the archetypical examples of 
the peas in a pod trend. There are several well-known planetary 
systems exhibiting similar architecture, such as Trappist-1 (Agol 
et al. 2021), TOI-178 (Leleu et al. 2021), Kepler-20 (Buchhave 
et al. 2016), and so on. This architecture is the most common 
outcome of planetary formation and is also the most frequent 
architecture class in our observed catalogue. 

Similar systems in the Bern model are composed of several 
low-mass planets. They tend to have limited diversity in plan- 
etary masses when compared with the observed systems. The 
mass distribution, for similar systems in the Bern model, shows 
that there are many low-mass (< 1 Mg) planets in these systems. 
This peak is missing in observations as well as synthetic observa- 
tions as low mass exoplanets are difficult to observe. This could, 
however, be remedied in future as current radial velocity spectro- 
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Fig. 6. Architecture plot showing the architecture of observed (left) and randomly selected synthetic planetary systems (right). Each row is for one 
planetary system and the circles in that row represent planets. The area of the circle encodes planetary mass, and the colour shows the equilibrium 
temperature. The coefficient of similarity for each system is shown on the right y-axis. The x-axis shows the semi-major axis, which is different 
for the two panels. 


graphs reach the ~ 20 cm/s precision necessary for discovering systems implies that these systems are prominently composed of 
exoplanets in the super-Earths and Earths mass range (Lillo-Box rocky planets, super-Earths and sub-Neptunes’. 
et al. 2021; Netto et al. 2021). The radius distribution of similar 


7 Throughout this paper, we use planetary classes (e.g. rocky, super- 
Earths, etc.) from the radius based classification of Kopparapu et al. 
(2018) 
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Fig. 7. Characteristics of the architecture classes. These plots show the distribution of various quantities (columns) as function of different cata- 
logues (rows). Left to right: Distributions of mass, radius, distance, and multiplicity in the following catalogues (top to bottom): Bern model, Bern 
RV Multis, Bern KOBE Multis, Bern Compact Multis, and observations. All catalogues are described in Sect. 2. Some notable features from these 
plots are discussed in Sect. 4. All individual distributions are normalised such that the area under each curve sums to unity. The dotted vertical line 
in the radius distributions marks 1.75R& — approximately, the location of the well-known gap in the radius distribution (Fulton et al. 2017). Since 
there are only two mixed systems with the same multiplicity (n = 4) in our observations catalogue, a vertical line replaces the density kernel. The 
Gaussian density kernels in all other cases were estimated using Scott's rule (Scott 2015). 
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The Bern RV Multis show a bimodal planetary distance dis- 
tribution for similar systems (as well as for mixed and ordered). 
The approximate location of the gap is 0.28 au or 55 d (for a solar 
mass star). This bi-modality is not visible in our observed cata- 
logue. Planets in similar and mixed systems in the Bern Model 
also show a dip around this location. In the Bern Model, in- 
wardly migrating giant planets (= 100 Mg) tend to stop around 
0.4 au or 100 d. Inside this region, low-mass planets are popu- 
lous. We attribute this bi-modality to these two populations of 
planets. This bi-modality probably arises because planets switch 
their orbital migration from type I to type II depending on their 
masses (Emsenhuber et al. 2021a). This bi-modality cannot be 
seen in Bern Compact Multis because we only include planets 
with periods less than 100d. For Bern KOBE Multis, the com- 
pleteness of the Kepler mission for large distant planets is poor 
(see Fig. C.2 in Mishra et al. 2021). However, a dip at this loca- 
tion in Bern KOBE Multis is visible. It would be interesting to 
see if such a bi-modality is also present in the Kepler catalogue. 
We tested the significance of this bi-modality with Hartigan’s dip 
test (Hartigan & Hartigan 1985). The dip test is suggestive of the 
bi-modality for the Bern RV Multis and Bern KOBE Multis (p- 
value < 0.05) and insignificant for the other catalogues. 


A system’s architecture is sensitive only to the relative distri- 
bution of a quantity (such as mass) amongst its planets and not 
the absolute distribution. HR 8799 offers an example (Marois 
et al. 2008) as a relatively young system with four directly im- 
aged giant planets. Our framework identifies the architecture 
of this systems as similar. Most observed similar systems are 
composed of low-mass planets ($ 100Mẹ), making HR 8799 a 
unique exception. This shows that the architecture framework is 
sensitive only to the relative variations in the mass. Additionally, 
there are only two systems (out of 1000) in our simulated cata- 
logue where a similar architecture arises from only giant planets. 
Even then, these two synthetic systems have only two giant plan- 
ets much closer to the star than the HR 8799 planets. The Bern 
Model does not produce many HR 8799-like systems. This sug- 
gests that a system with similar architecture made up of only 
giant planets is probably rare. One possibility could be that sys- 
tems (e.g. HR 8799) with such architecture are probably diffi- 
cult to form via core accretion pathway (Konopacky & Barman 
2018). Such systems may require additional formation mecha- 
nisms such as protoplanetary disk instabilities (Schib et al. 2021; 
Boley et al. 2010; Kratter et al. 2010). 


4.4. Architecture class: mixed 


Planetary systems where the planetary masses (inside-out) show 
broad increasing and decreasing variations have mixed archi- 
tecture. GJ 876 and Kepler-89 host planetary systems with a 
mixed class architecture. GJ 876 is an M dwarf low luminous 
(~ 0.01 Lo) star hosting four planets with masses between 
8 — 888Me. The outer three planets are in a Laplace mean- 
motion resonance (Millholland et al. 2018). Kepler-89, on the 
other hand, is an early F, highly luminous (z 3.5 Lo) star. It hosts 
a compact four planet system with masses between 10 — 100Ms. 
Despite the starkly different stellar properties, the architecture 
of these two systems is analogous: Cs(M) = 0.12 and 0.17, 
Cy(M) = 1.2 and 0.9, respectively. While the coefficient of simi- 
larity is low for both systems, the coefficient of variation is larger 
than 3/2, which helps us identify the architecture of these sys- 
tems as mixed class. Indeed, Fig. 6 indicates that this identifica- 
tion is correct. 
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The frequency of this architecture class in the Bern model 
is ~ 8.2%. The Bern model’s synthetic mixed architecture 
planetary systems (Fig. 6 right) tend to have numerous Earth- 
mass planets outside 10 au. This parameter space (mass-distance 
plane, Fig. 1), however, remains inaccessible to most exoplanet 
detection techniques. These systems are also composed of super- 
Earths, sub-Neptunes, Neptunes, and Jovian planets. The bi- 
modality in distance distribution (discussed before) is prominent 
for these architectures in Bern RV Multis. We found a Harigan's 
dip statistic of 0.03 and p-value of ~ 0.2 (Hartigan & Hartigan 
1985). 


4.5. Architecture class: anti-ordered 


Planetary systems where the planetary mass shows an overall 
decrease with distance have an anti-ordered architecture. There 
are no observed examples of this architecture class in our cata- 
logue. The frequency of this architecture class in the Bern model 
is & 8.496. About x 4% of systems in Bern KOBE Multis, 
=~ 3.2% of systems in Bern Compact Multis, and « 1.2% of 
systems in Bern RV Multis have this architecture. This shows 
that it is an observationally challenging system architecture to 
detect. However, even if 196 of observed exoplanetary systems 
are Anti-Ordered we should already have found about 30-40 
such systems. More work is necessary to identify the handful 
of these systems from the already observed systems. Many cur- 
rently known single hot Jupiter systems may host additional 
small, distant, and as yet undetected planets — revealing these 
potentially anti-ordered systems. 


Anti-ordered systems in the Bern Model are mostly com- 
posed of low mass planets € 5Mg and giants z 100Mgs. In 
the Bern Model, the radius distribution of this architecture class 
peaks for Rocky and Super-Earths planets. It decreases for sub- 
Neptunes and Neptunes and then increases again for Jovian plan- 
ets. Many of the low-mass planets that make up this architecture 
class are outside 10au, making their detection very challenging. 
The multiplicity distribution shows that these systems tend to 
have fewer planets than similar or mixed architecture. This is 
an indication that the formation pathway of these architectures 
differs considerably from the other two types of architecture. 
Planets from anti-ordered architectures show a weak distance bi- 
modality feature (discussed earlier in this work). This is under- 
standable since these architectures consist of massive planets in 
the inner parts and less massive planets in the outer parts of the 
system. The distance bi-modality seems to arise from low mass 
planets (migrating via type I) inside 0.28au or 55 days and giant 
planets (migrating via type II) outside 0.28au or 55 days. This 
adds further strength in attributing the distance bi-modality to 
planetary migration. 


4.6. Architecture class: ordered 


Planetary systems where the planetary masses shows an overall 
increase with distance have an ordered architecture. The increas- 
ing mass may be monotonic (e.g. TOI-561, HD 20781, DMPP- 
1,HD 160691, HD 164922) or non-monotonic (e.g. the Solar 
System, Kepler-11, 55 Cnc, Kepler-48, Kepler-65). Ordered ar- 
chitecture is a rare outcome for the Bern model. Observations 
are generally biased against discovering small and less massive 
planets which are farther away from their host star. Such biases, 
however, make ordered systems the second most common archi- 
tecture class. Fifteen systems in our catalogue exhibit this archi- 
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tecture. Unsurprisingly, the most notable known example of this 
architecture class is the Solar System. 

The mass and radius distributions of ordered architecture in 
the Bern Model shows considerable difference from other archi- 
tecture. The mass distribution peaks around 1000M 9. Most of 
the Bern model’s ordered systems tend to have at least one gi- 
ant planet. These systems are also composed of sub-Neptunes, 
Neptunes, and Jovian planets. 


5. Internal composition across architecture classes 


So far we have seen the new architecture framework (Sect. 3) and 
some characteristics of the four classes of architecture (Sect. 4). 
In this section, we study the connection between the bulk mass 
architecture classes and the internal composition of the planets. 
This section demonstrates that the same architecture framework 
can be used to study the multi-faceted nature of planetary sys- 
tem architecture — from bulk mass architecture to density archi- 
tecture. We study several different aspects of the planetary in- 
ternal composition: (a) radius architecture (Sect. 5.1); (b) bulk 
density architecture (Sect. 5.2); (c) Core/Envelope mass archi- 
tecture (Sect. 5.3); and (d) fraction of volatiles and water ice in 
core architecture (Sect. 5.4). We explore these connections for 
planetary systems in the simulated (Bern model) and syntheti- 
cally observed catalogues (Bern RV Multis, Bern KOBE Mul- 
tis, Bern Compact Multis). All results in this section are derived 
from synthetic planetary systems only. 


5.1. Radius architecture 


Weiss et al. (2018) showed that the size of adjacent exoplanets 
were similar — coining the phrase “peas in a pod’ to describe this 
architecture. Millholland et al. (2017); Wang (2017) extended 
these ideas to planetary masses, showing that the masses of ad- 
jacent planets are also correlated. In Mishra et al. (2021), we 
suggested that the peas in a pod trends in terms of size effec- 
tively emerge from the mass trends. Here, we attempt to set our 
assumption on firmer ground. 

Figure 8 (top) shows the coefficient of similarity for radii as 
a function of the coefficient of similarity of masses, for systems 
with two or more planets. This allows us to compare the system- 
level radius architecture with the system-level mass architecture. 
We easily see that most systems seem to follow a linear rela- 
tionship. The Pearson correlation coefficient is 0.89, indicating 
a strong positive correlation between the mass and radius archi- 
tecture. The coefficient value increases to 0.96, when systems 
with only three or more planets are considered. Since the mass- 
radius relation is not a bijective function (i.e. one-to-one corre- 
spondence), there are some systems that show a strong deviation 
from the linear relation. 

Figure 8 (bottom) shows the radii architecture for the syn- 
thetic planetary systems?. This shows that most systems that 
are ordered (or anti-ordered) in mass are also ordered (or anti- 
ordered) in terms of radius. The figure also shows that systems 
which are similar or mixed in mass architecture have Cs(R) ~ 0. 
Systems with mass similarity have lower Cy(R) compared to sys- 
tems with mass mixture, suggesting that for most systems, the 


* A future study could investigate the boundaries for robust architec- 
ture identification, as in Eq. 3, but based on radius instead of mass. 
Such a classification is readily applicable since radius measurements 
tend to be uniformly available and are better agreed upon amongst sev- 
eral observers. Data-driven approaches such as machine learning could 
be useful in such an endeavour. 
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Fig. 8. Radii architecture. Top: The diagram shows the coefficient of 
similarity of radii as a function of the coefficient of similarity of masses, 
for synthetic and observed planetary systems. The dashed line shows the 
corresponding linear fit. Bottom: Radius architecture of synthetic plane- 
tary systems contrasted with the mass architecture. In the bottom panel, 
the marker colour and shape indicates the bulk mass architecture of a 
system and its position on the diagram suggests its radii architecture. 


radius architecture closely follows the mass architecture. At the 
planetary level the radius of a planet is correlated with its mass 
via the planet's chemical composition (Lopez & Fortney 2014). 
Our architecture framework shows that such relationships also 
exist at the system level. A few mass-ordered systems show sim- 
ilarities in radius. These few systems have the following com- 
mon features: two mass-ordered giant planets with similar sizes 
(masses ~ several M,’s, and radius ~ 1R,). This illustrates that 
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while mass architecture and radius architecture are related, they 
are not always identical. 


We conclude that the peas in a pod radius correlations gen- 
erally arise from the underlying mass architecture. We consider 
the mass architecture primal because planets, foremost, accrete 
mass from the protoplanetary disk and, consequently, are char- 
acterised by a size that is in accordance with their internal struc- 
ture. 


5.2. Density architecture 


Bulk density (or simply density) is a directly measurable quan- 
tity which is sensitive to the internal structure of a planet. 
This makes density an important characteristic for understand- 
ing planetary structure. The density of a planet depends on 
many parameters and many physical processes. For example, a 
planet’s mass may depend on its accretion history, starting loca- 
tion, amount of material in disk, competition with other planets, 
and so on. Giant impacts may also affect a planet’s density, as 
explained in Bonomo et al. (2019). In this section, we study the 
arrangement and distribution of planetary density around their 
host star, namely, the density architecture of a system. 


Figure 9 (left) shows the density of a planet, simulated via the 
Bern model, as a function of its mass and starting location. The 
figure also shows the density of solar system planets and few ob- 
served exoplanets (from our catalogue). The plot can be roughly 
divided into two halves: (a) planets with a mass of < 100 Mg and 
(b) planets with a mass of > 100 Mg. In our simulations, most 
planets which started inside the ice line tend to have terrestrial 
Earth-like densities. These planets are 0.5 — 3Rg and g 10Ma. 
Planets starting around or outside the ice line generally accrete 
more volatile rich material and H/He gas. These planets have 
lower densities due to their larger sizes. Planet which started 
outside the ice line (3-10 au) show a broad diversity in their den- 
sities. As they accrete more gases, their density decreases fur- 
ther. These planets are roughly 2 — 10 Rg and are characterised 
by masses that vary by four orders of magnitude. Planets more 
massive than 100 Mg seem to lie on a single curve. Since the size 
of these planets remains the same (~ 1R; or 11Rg,), their densi- 
ties increases linearly with their masses. Planets that started in 
the outer regions (30-40 au) cluster on the density-mass plane. 
These planets have low densities (« 2g/cm?) and low masses 
($ 1 Mo). 

The density architecture for simulated systems in the Bern 
Model is shown in Fig. 9 (middle). An important relation be- 
tween mass architecture and density architecture is seen. Some 
systems which are ordered (or anti-ordered) in mass are also or- 
dered (or anti-ordered) in density, that is, these systems have 
large positive (or negative) Cs(p). In other words, simulations 
suggest that planetary systems can also be ordered or anti- 
ordered in density. A system is ordered in density when the in- 
ner planets have small densities and the outer planets have larger 
densities — and vice-versa for density anti-ordered systems. Sys- 
tems with mass architectures of similar and mixed are strongly 
clustered around Cs(p) « O and Cy(p) « 1. The inset shows 
that similar mass systems tend to have small Cy(p), while mixed 
mass systems have larger Cy(p). This implies that some systems 
that are similar (or mixed) in mass show some similarity (or mix- 
ture) in density. A system with a similar density architecture will 
host planets that have approximately similar densities. However, 
the region Cs(p) « Cy(p) « 0 is empty, indicating the absence 
of planetary systems where the density of planets (inside out) 
is approximately the same. While there are exceptions, overall, 
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for many systems, the density architecture seems to follow their 
mass architecture. 

This approximate link between the mass and density archi- 
tecture stems from massive planets (> 100 Mg) whose densities 
increase with their mass (see 9 (left)). Systems which do not host 
any massive planet are mostly similar in their mass architecture 
and have Cs (p) « 0. The inset shows that the Aryabhata's num- 
ber increases as a system approaches the Cs(p) ~ Cy(p) ~ 0 
region (see Paper II for the definition of Aryabhata's number). If 
a system has more surviving planets that started from inside the 
ice line, then the densities of these planets will be more similar to 
each other. This means that the density architecture of a system 
shows some dependence on the starting location of a planet. 

We also investigated if the relation between the mass and 
density architectures is observable. Figure 9 (right) shows the 
density architecture for systems from our synthetically observed 
catalogues. Also shown is the density architecture of some ob- 
served exoplanetary systems for which the mass and radius mea- 
surements were available. The density architecture of syntheti- 
cally observed catalogues shows a trend which is quite unlike 
Fig. 9 (middle). There is an unexpectedly good agreement be- 
tween the synthetically observed systems and the observed plan- 
etary systems. We attribute the peculiar shape of this plot to the 
difficulty of detecting distant planets. Transit and RV observa- 
tions favour the detection of planets within ~ 1 au. Many close- 
in planets tend to have Earth-like densities, while planets far- 
ther out have lower densities (due to either their volatile rich or 
gaseous composition). Overall, this would lead to an observed 
density architecture where inner planets have higher densities 
and outer planets have lower densities. A situation such as this 
will be characterised by negative Cs(p), which is readily seen 
from Fig. 9 (right). 

In summary, many synthetic systems show a relationship be- 
tween their mass architectures and their density architectures. 
Bern model systems that are ordered or anti-ordered in their 
mass also tend to be ordered or anti-ordered in their densities. 
The dispersion of planetary bulk densities in similar class sys- 
tems is lower than mixed class systems. This relation seems to 
emerge from massive planets whose densities increases linearly 
with their masses (since they cannot grow their sizes any more). 
These relations can be considered as a prediction from this work. 
As future observations probe the outer parts of an exoplane- 
tary system, we may anticipate the discovery of several systems 
whose mass and density architectures are closely linked. 


5.8. Core and envelope mass architecture 


In this section, we show that (a) most simulated planetary sys- 
tems inherit their architecture from the underlying core mass 
architecture; (b) the accretion of gases tends to accentuate the 
underlying core mass architecture, and (c) the observed mass ar- 
chitecture of a planetary system is a gateway to studying the core 
mass architecture of the system, since the two are strongly cor- 
related. Exceptions to the first two statements tend to arise for 
those systems undergoing strong, multi-body dynamical effects 
such as planet-planet scattering. 

The fraction of mass which is partitioned into a planet's core 
and its envelope is governed by planetary formation physics. The 
end result is dictated by an interplay of several concurrent pro- 
cesses (see Emsenhuber et al. 2021a; Mordasini et al. 2012b, 
for discussion). In the core-accretion scenario, giant planets are 
formed when planetary cores can undergo run-away gas accre- 
tion (Pollack et al. 1996; Alibert et al. 2004, 2005). Proto-planets 
that have failed to trigger runaway gas accretion comprise a di- 
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coefficient of similarity (left) and coefficient of variation (middle). In the left panel, the points corresponding to similar systems are very tightly 
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colour. The right panel plots the coefficient of similarity for masses and core masses for systems in the synthetically observed catalogues. 


verse group of planets: Earths, Super-Earths, mini-Neptunes, and 
Neptunes. 

The bifurcation of a planet's mass into its core and its enve- 
lope can probe the formation history. For example, in our sim- 
ulations, most giant planets (z 1 Mj) have about 1% of their 
mass in their cores and the rest is in their gassy envelope. On 
the other hand, low mass planets (g 10 Mg) hardly accrete any 
gaseous envelope. However, the mass in a planet's core and en- 
velope is not an observable. Even for the solar system planets, 
internal structure models guide our knowledge of core and enve- 
lope masses (see Helled et al. 2020, for a review on Uranus and 
Neptune). 

As giant planets dominated by their H/He envelopes are rare, 
we expect a strong correlation between the mass architecture (i.e. 
the arrangement and distribution of planetary masses) and the 
core-mass architecture (i.e. the arrangement and distribution of 
core-masses) to exist also at the system level. In Fig. 10, we show 
the coefficient of similarity and the coefficient of variation of 
planetary mass as a function of the coefficient of similarity and 
the coefficient of variation of core mass. The colour indicates the 
total mass of envelope accreted by all planets in a system. 

Comparing the coefficient of similarity for planetary masses 
and core masses (Fig. 10, left panel), we observe that a large frac- 


tion of systems (> 90%) follow the y=x line. This implies that for 
most planetary systems, the arrangement and distribution of core 
masses is imprinted on the mass architecture of the system. Sys- 
tems which show large deviations from the y=x line have gener- 
ally accreted a large amount of gaseous envelope. This suggests 
that the formation of one or more giant planet is partly responsi- 
ble for the deviations. We also observe another important feature. 
Planetary systems that are ordered in mass are also often ordered 
in their core-masses. Conversely, mass anti-ordered systems tend 
to be anti-ordered in their core masses as well. In addition, or- 
dered systems are either on or above the y=x line, whereas anti- 
ordered systems are either on or below this line. This suggests 
that the accretion of gases generally accentuates the underlying 
core mass architecture. 


Considering the coefficient of variation for masses and core 
masses (Fig. 10, middle), we see that most of the planetary sys- 
tems lie either on or above the y=x line. The Cy value measures 
the amount of variation in a set of numbers. This suggests that 
the variation in total masses, for most systems, is either similar 
or larger than the variation in the core masses. This is under- 
standable, since the amount of gas accreted by a planet shows 
a strong correlation with the mass of the planet's core. How- 
ever, there are a handful of systems where the variation in total 
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mass is less than the variation in core masses. Systems that are 
similar in the mass architecture are strongly clustered over the 
y=x line. This stems from the low amount of gas (0 — 20Mẹ) 
accreted by planets in these systems. Figure 10 (middle) shows 
that mixed class systems, as opposed to similar systems, form 
a separate cluster. Physically, this difference is arising from the 
larger amount of gas (50 — 5 000Mgs) accreted by planets in these 
systems. 

Here, the question arises as to whether the strong correlation 
between mass architecture and core-mass architecture is observ- 
able. In Fig. 10 (right), we show Cs(M) as function of Cs (Meore) 
for the three synthetically observed catalogues. All three cata- 
logues probe the inner regions of a planetary system. The figure 
shows that the correlation between mass architecture and core 
mass architecture is strong in all three catalogues. This suggests 
that the observed mass architecture of a planetary system can be 
used to study the underlying core-mass architecture of the sys- 
tem. This is potentially useful to distinguish among competing 
models of planet formation. 


5.3.1. Role of embryo starting location 


We have seen that the core mass architecture of a system strongly 
governs the overall architecture of the system. The arrangement 
of planets in a system also reflects the final distances of these 
planets. It is, therefore, instructive to understand some key as- 
pects which shape these two important properties. The core mass 
and the final distance of a planet are strongly influenced by, 
among other effects, the distance at which an embryo starts in 
our simulations. Figure 11 shows the core mass (left) and the 
final distance (right) as a function of the starting distance. In 
the Bern model, lunar mass (0.01Mg) protoplanetary embryos 
are initialised with a random starting location between the inner 
edge of the disk and 40 au. We also recall that failed embryos 
(objects with a total masses < 0.1Mg) are removed from our 
analysis. 

Emsenhuber et al. (2021a); Burn et al. (2021) analysed the 
nature of planetary migration using migration maps. Both stud- 
ies show the existence of so-called convergence zones. Within 
these zones, planets can migrate outwards. However, outside this 
zone inward migration is prevalent. The existence of such con- 
vergence zones suggests that there ought to be regions of planet 
over-densities; this are essentially regions where planets are ra- 
dially ‘stuck.’ These studies attribute the presence of these zones 
to dust opacity transitions and disc structures, finding that these 
zones evolve with the disc. For a 0.01 Me disc, around a solar 
mass star at 1 Myr, these zones are: (a) for low-intermediate mass 
planets (€ 1 M3) extending from disk inner edge to about lau and 
(b) for intermediate mass planets (1 — 10Mg) around 2-3 au. 

Figure 11 (left) shows that even for embryos that start at 
the same initial distance, the mass accreted by a planetary core 
can differ by two to three orders of magnitude. These differ- 
ences arise from (a) varying solid disc masses; (b) competition 
for accretion in the feeding zone (Alibert et al. 2013); (c) dy- 
namical state of solids in the disc resulting from planetesimal- 
planetesimal, planetesimal-protoplanet, planetesimal-gas disc 
interactions, and so on. Nevertheless, the starting distance seems 
to play a significant role in this scenario. The ice line seems to 
divide the parameter space into two regions: fewer planets inside 
the ice line have low mass cores (g 1Ma), while many planets 
outside the ice line have low-mass cores. 

Inside the ice line, most planets have cores of 1—10Mg. Plan- 
ets that start very close to the star (g 0.1au) are unable to accrete 
a lot of material owing to their small Hill spheres. This explains 
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their small cores masses. Inside the ice line, planets belonging to 
systems of mixed, anti-ordered, and ordered architecture tend to 
have more massive cores than planets belonging to similar sys- 
tems. Around the ice line, planets show a large variety of core 
masses ranging from 0.1 Me to 100Mg. Outside the ice line we 
see the same trend as before: planets that are in similar systems, 
for the same starting location, usually have less massive cores 
than planets which belong to systems of other architectures. 

The final distance of a planet depends on several factors such 
as: (a) migration type (type I or type ID, (b) planet's mass, (c) 
local disc properties, and (d) multi-body effects such as N-body 
scattering. The joint distribution of a planet's final and starting 
locations shows an intriguing trend. Generally, for many plan- 
ets, the final distance strongly correlates with their starting loca- 
tion. Orbital migration allows planets to move (mostly) inwards 
— positioning many planets below the y=x line. N-body effects 
(such as planet-planet scattering or outward migration) may scat- 
ter some planets further away from their host star. These planets 
are located above the y=x line. Curiously, many planets which 
end up farther away than their starting location were initialised 
around the ice line and are mostly low massive (g 20Me). We 
attribute this over-density to the outward migration convergence 
zone around the ice line discussed above. 

Another important finding is that planets inside the ice line 
in similar systems probably formed in situ. Fig. 11 (right) shows 
that most planets, inside the ice line, which did not migrate in- 
wards are part of similar architecture systems. Conversely, most 
of the planets which have migrated inwards seem to belong to 
systems that have mixed, anti-ordered, and ordered architectures. 
Outside the ice line, many planets have migrated inwards. Most 
planets starting around 20 au (or more) accrete little mass in their 
cores and show little radial displacement (Hansen & Murray 
2012; Chiang & Laughlin 2013). The properties of these em- 
bryos may draw some influence from our modelling choice as 
well. The N-body integrator in this model is used for 20 Myr. 
Longer integration times may allow some embryos to have more 
massive cores via giant impacts. 


5.4. Core water-ice mass fraction architecture 


Our model calculates the internal structure of a planetary core 
(for details see Emsenhuber et al. 2021a; Mordasini et al. 20122). 
We solved 1D differential equations demanding mass conser- 
vation and hydrostatic equilibrium, with a modified polytrope 
equation serving as the equation of state (Seager et al. 2007). 
The chemical composition of each planetary core is also tracked. 
This is accomplished by tracking the chemical makeup of the 
accreted planetesimals and other colliding planets. The underly- 
ing chemical models have thirty-two refractory and eight volatile 
species (Thiabaud et al. 2014; Marboeuf et al. 2014a,b). These 
different chemical species are grouped into three different ma- 
terials which make the planet's core, in our model: (a) iron, (b) 
silicates, and (c) ice. All refractory species (except iron) make 
up the silicate mantle and all volatile species contribute to ice. 
Since H2O constitutes 60% of all ice by mass, we label this 
latter component as water ice. The water mass fraction (fice) of 
each planetary core is computed. 

We assume that inside the H20 ice line, only refractory ele- 
ments contribute to the solid phase of a planetesimal. Outside 
this evolving ice line, due to their condensation, volatile ele- 
ments also contribute to the solid phase of a planetesimal. Fig- 
ure 12 (left) shows the water mass fraction of a planet's core as 
a function of its initial location. Most planets which start inside 
the ice line have little to no volatiles in their cores. A jump in 
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Fig. 12. Planetary core water-ice mass fraction. Left: Core water mass fraction of a planet as a function of its starting location. The architecture 
of the system to which a planet belongs to is shown by marker characteristics. The vertical shaded regions shows the location of the ice line. 
Middle: Water mass fraction architecture seen through the coefficient of similarity versus the coefficient of variation plot. The shape of the marker 
shows a system's mass architecture, and the colour depicts its Aryabhata's number (see Paper II for definition). Right: Distribution of fice across 
architecture classes. Depending on fice, planets are labelled as ‘dry’, ‘moist’, or ‘wet’. 


fice is seen around the ice line. Outside the ice line, most plan- 
ets have at-least 40% fice in their cores. This suggests that the 
history of formation and evolution of a planet is imprinted on its 
water mass fraction. 


We are interested in studying the ice mass fraction architec- 
ture of a planetary system. However, we cannot directly apply 
our framework (Eqs. 1, 2) because the water mass fraction is 
a quantity that admits O as a value. While this can lead to ill- 
defined numbers, this issue has a simple remedy. For quantities 
that can be 0, we propose the following modification to Eq. 1: 


Cs(q) = lim (4) 


1 i=n-1 
sc P3 [vo 


qQi«1 t€ 
di + € ` 


Numerically, we calculated the coefficient of similarity with 
e = 1079, We verified this step by calculating the coefficient 
of similarity for quantities which do not admit zero (such as 
masses). In a bootstrapped numerical experiment of 10,000 tri- 
als, the coefficient of similarity for mass was calculated using 
both equations (1 and 4). The relative difference between the 
two outcomes ranged between 107"? to 1079. 


The ice mass fraction architecture of Bern Model systems is 
shown in Fig. 12 (middle). A prominent feature from this fig- 
ure is that most systems have Cs (fice) either close to 0 or posi- 
tive. A system with Cs (fice) ~ 0 and low Cy(fice) will be com- 
posed of planets whose core water mass fraction is similar to 
one another. A system with positive Cs (fice) will be composed 
of planets whose core water mass fraction increases inside out. 
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Figure 11 (right) tells us that many planets that started outside 
the ice line, and are water rich have not suffered any major ra- 
dial displacement. Thus, a positive Cs (fice) should be a default 
scenario for most planetary systems. About 74% systems in the 
Bern model have Cs(fice) > 0.1. Almost 97% of systems have 
Cs (fice) > 0. We propose the ‘Aryabhata formation scenario’ to 
explain the *non-default' systems. This scenario and the related 
quantity ‘Aryabhata’s Number’ are described in Paper II. 


5.5. Frequency of dry, moist, and wet planets 


We are interested in exploring the link between the water mass 
fraction architecture and the mass architecture of a system. To 
this end, we divide planets into three categories based on their 
water mass fraction. A planet is called ‘dry’ if fice < 1%, ‘moist’ 
if fice € (1,40]%, and ‘wet’ if fice > 40%. These labels serve to 
simplify our analysis and allows us to see general trends between 
system architecture and planetary composition. The distribution 
of water mass fraction across systems of different architecture 
classes is shown in Fig. 12 (right). While all three planet classes 
are present in all four architecture classes, there are some ob- 
servable trends. 

Figure 12 (right) shows that similar architectures host many 
of the dry planets produced in the Bern model and anti-ordered 
architectures are mostly composed of wet planets. This tells us 
that many of the planets that start inside the ice line become part 
of similar architecture systems. Conversely, systems with anti- 
ordered architecture are mostly composed of planets that started 
outside the ice line. Mixed architecture systems are generally 
composed of more planets that started outside the ice line than 
inside, as compared to similar architecture systems. Moist plan- 
ets are present in all architecture classes. We quantify the fre- 
quency of dry, moist, and wet planets as a function of mass archi- 
tecture class (similar, mixed, ordered, or anti-ordered), metallic- 
ity (low or high), and source catalogues (Bern model, Bern Com- 
pact Multis, Bern KOBE Multis, and Bern RV Multis). Figure 13 
shows the planets per star (i.e. the number of each planet type di- 
vided by the number of stars) across these forty sub-categories. 

Overall, compared to synthetically observed catalogues, 
Bern model simulations demonstrate more wet planets. This 
is understandable since we are looking at the entire underly- 
ing population, which includes planets from the outer parts of 
these systems. Likewise, synthetically observed catalogues tend 
to have more dry planets. Systems around low-metallicity stars 
(regardless of the catalogue) generally tend to have a higher 
frequency of dry planets as opposed to systems around high- 
metallicity stars. The frequency of wet planets shows a notice- 
able increase for systems around high-metallicity stars. Amongst 
the different catalogues, Bern Compact Multis have the highest 
frequency of dry planets, followed by Bern KOBE Multis, and 
Bern RV Multis. Low-metallicity environments have a slightly 
higher average planet per star (8835/541 ~ 16.3) than high- 
metallicity environments (6722/455 ~ 14.8). 

Similar systems: Systems in the underlying Bern model that 
are characterised by a similar architecture tend to have many wet 
planets (~ 10 per star) and few dry or moist planets (~ 3 — 4 
per star). However, synthetically observed catalogues seem to 
have a bias against the discovery of many wet planets. For the 
similar class of compact multi-planetary systems, dry planets 
are more common around a low-metallicity star. However, for 
a high-metallicity star, the frequency of dry and wet planets is 
roughly the same. For transiting systems, in the Bern KOBE 
Multis, low-metallicity environments favour more dry planets 
and equal proportions of wet and moist planets. Conversely, 
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in high-metallicity environments, wet planets occur more fre- 
quently than dry or moist planets. For RV systems, the fre- 
quency of each planet class is approximately the same in a low- 
metallicity environment. High-metallicity environments almost 
double the frequency of wet planets. The average planet per star 
is similar around both low metallic (~ 16.8) and high metallic 
environments (z 17.3). 

Mixed systems: Mixed class systems generally have many 
wet planets. It is only for compact systems around high- 
metallicity stars, the frequency of dry planets is higher than wet 
planets. In all other cases, the frequency of wet planets is greater 
than the frequency for dry or moist planets. The average planet 
per star is similar around both low-metallicity (~ 15.2) and high 
metallicity environments (~ 15.3). 

Anti-Ordered systems: Systems with anti-ordered architec- 
ture have a distinct core water mass fraction architecture. These 
systems are rich in wet planets. In fact, about 80% of these sys- 
tems follow the Aryabhata formation scenario described in Pa- 
per II. Compact anti-ordered systems may have some dry plan- 
ets. For transit and RV surveys, the frequency of dry planets is 
zero in our simulations. The total number of planets per star 
in anti-ordered systems is slightly higher around low metallic- 
ity stars (159/19 ~ 8.4), as compared to high metallicity stars 
(504/65 ~ 7.8). In the future, if an anti-ordered architecture 
planetary system is to be discovered, it would be interesting to 
study its core water mass fraction architecture as well. The cur- 
rent work suggests that the Aryabhata's number for these sys- 
tems should be close to 0 and, irrespective of the detection tech- 
nique, the system should would be expected to have many wet 
planets (see Paper ID); this is one of the main predictions arising 
from this work. 

Ordered systems: Juxtaposed directly to the anti-ordered sys- 
tems, ordered systems in synthetically observed catalogues tend 
to be rich in dry planets. These systems are distinct not only 
because of their frequent dry planets, but also due to a low fre- 
quency of wet planets. For all synthetic catalogues, moist plan- 
ets occur more frequency than wet planets, which is a unique 
distinguishing feature for these systems. For the Bern model, 
these systems have low average planets per star: 5 around low- 
metallicity stars and 3.1 around high-metallicity stars. 

In summary, we note some salient features of these sys- 
tem architectures. Generally, wet planets survive more fre- 
quently around high-metallicity stars. One detection technique 
that favours the discovery of close-in planets also favours the de- 
tection of dry planets. The comparative frequency of planet (dry, 
wet, or moist) per star seems to be intimately connected with 
the mass architecture of the system. Similar and mixed systems 
can host lots of dry or wet planets, depending on the metallicity 
of the systems and detection technique. Anti-ordered systems, 
forming prominently via the Aryabhata formation scenario, are 
rich in wet planets. Ordered systems, in simulated observations, 
are rich in dry planets and have more moist planets than wet 
planets. The physical connection between the average planet per 
star and the star's metallicity is sensitive to the formation path- 
ways that a system undergoes. 


6. Habitability as a function of system architecture 


In this paper thus far, we have described a new framework for 
studying the architecture of planetary systems (Sect. 3), the char- 
acteristics of the four classes of system architecture (Sect. 4), 
and the relation between the mass architecture of a system and 
its internal structure and composition architecture (Sect. 5). In 
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Fig. 13. Frequency of planets. This diagram shows the average planet per star for dry, wet, and moist planets in several catalogues (rows), across 
several architecture classes (columns), and around low (left) and high (right) metallicity stars. The planet per star is simply the total number of 
planets divided by the total number of stars, after appropriate filters for metallicity, catalogue, or architecture. 


this section, we speculate on the idea of studying habitability as 
a function of system-level architecture. 


Mankind has pondered the existence of other biotic life- 
forms beyond Earth, as well as outside our own Solar System. 
Our current understanding of habitability stems from and is fo- 
cused at an individual planetary level. We consider whether hab- 
itability could be correlated with other properties of a plane- 
tary system, namely, whether habitability could be a system- 
level phenomenon. In this section, we speculate on the role of 
planetary-system level information on the existence of habit- 
able worlds in such systems. The framework we present here 
for studying the system-level architecture of a planetary sys- 
tem brings to light several novel questions, probing the depen- 
dence of habitability and occurrence of habitable worlds (and 
related concepts) on the architecture of a said system. For ex- 
ample, we wonder how the occurrence rate of habitable planets 
in the galaxy depends on the occurrence of the four architecture 
classes. 


In this section, we address this question on three levels: sys- 
tem, planet, and planet ratio. We use the concept of empirical 
Habitable zone (EHZ) planets from Quanz et al. (2022); Koppa- 
rapu et al. (2014). Planets with masses between [0.1, 5] Ma and 
stellar insolation within [1.776, 0.32]S 4 are considered to be in- 
side the EHZ. The stellar flux limits correspond to ‘recent Venus’ 
and ‘early Mars’ scenarios and include the luminosity evolution 
for a 1Me Solar-twin. At the system level, we note the frequency 
of systems of a particular architecture to host at least one planet 
in the EHZ. At the planet level, we count the frequency of planets 
in the EHZ across each system architecture class. At the planet 
ratio level, we show the fraction of all EHZ planets across their 
architecture class. Figure 14 shows the frequency of EHZ plan- 
ets, at all three levels, as a function of their system architecture 
for both synthetic and observed exoplanetary systems. 

Out of all synthetic systems with a similar class architec- 
ture, ~ 77% host at least one EHZ planet. This is remarkably 
higher than any other architecture class. ~ 10% of systems with 
mixed architecture host at least one EHZ planet. The frequency 
drops to « 1% for anti-ordered architecture systems and « 0% 


for ordered systems. One way to interpret these numbers could 
be to look at the multiplicity distribution across each architecture 
class in Fig. 7. The frequency of at least one EHZ planets across 
architecture class seems to follow the multiplicity trends. Sim- 
ilar and mixed architectures have comparably high number of 
planets. The distribution of the Aryabhata's number shows that 
similar systems usually have higher Aryabhata's number than 
mixed systems, implying that similar systems tend to host more 
planets which started from inside the ice line (see Paper II for 
Aryabhata's number). This may account for the large frequency 
of similar systems which host at least one EHZ planet. The mul- 
tiplicity distribution shows that anti-ordered systems often host 
less planets than similar and mixed class systems, while ordered 
systems have the lowest multiplicities. We see in Sect. 4.2 that 
the similar class architecture is perhaps the most common archi- 
tecture for planetary systems in our galaxy. These results from 
the Bern model simulations suggest that observation campaigns 
to detect habitable planets will find more EHZ planets in similar 
class architectures. 


For the observed multi-planetary systems in our catalogue, 
about ~ 13% of similar class systems have at least one EHZ 
planet. About 796 of ordered class exoplanetary systems in our 
catalogue host at least one EHZ planet. In the mixed class ob- 
served systems in our catalogue, none of them have EHZ planets 
and there are no known anti-ordered class systems in our cata- 
logue. These frequencies are quite different from their theoretical 
counterparts. While the lack of a complete and reliable obser- 
vations catalogue may explain the discrepancy for similar class 
systems — it does not completely explain the discrepancy for or- 
dered systems. Our own planet resides in the ordered class sys- 
tem of the Solar System, which is not supposed to be influenced 
by issues such as completeness or detection biases. This reflects 
the inability of Bern models to simulate a Solar System analogue 
— pointing to a gap in our understanding of the physics that goes 
into planetary formation and evolution. In addition, many ob- 
served ordered class systems may have a different architecture 
when more planets in these systems are detected. 
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Fig. 14. Planets inside the empirical habitable zone (EHZ). The left-most plot shows the frequency of planetary systems, of a given architecture 
class, which host at least one planet inside the EHZ. The central-left plot shows the fraction of planets inside a given architecture class which are 
in the EHZ. The central-right plot shows the fraction of all EHZ planets within a given architecture class. The rightmost plot shows the distribution 
of fice for EHZ planets across the architecture classes. The cartoon sketch of Earth emphasises that the only known life-harbouring planet resides 
in an ordered system. The length of error bars visualises the total number of systems or planets in respective bin as: 100/ Vbin counts. The lengths 
of the error bars represents the number of planetary systems (left-most panel) and the number of planets (two middle panels) which are inside 
the bin. Large error bars in the leftmost panel, for example for anti-ordered architecture emerges from their low count (see Fig. 5). The Gaussian 


kernel is estimated using Scott's rule (Scott 2015) 


At the planet level in our simulations, out of all synthetic 
planets that exist in similar class systems, about 1096 are inside 
the EHZ. This frequency is, again, remarkably higher for any 
other architecture class. About 1% of all simulated planets in a 
mixed system are inside the EHZ. Close to 0% of all planets in 
anti-ordered and ordered class architectures are inside the EHZ. 
From our observational catalogue, while 5% of observed exo- 
planets in similar class systems are inside the EHZ. About 3% 
of observed exoplanets in ordered class systems are inside the 
EHZ. 


The planet ratio level shows the fraction of all EHZ planet 
that belong to a particular architecture class. In the Bern model, 
we see that out of all EHZ planets, about 99% are in the simi- 
lar class. The share of EHZ planets by other architecture classes 
is negligible. Amongst the observations, three-quarters of EHZ 
planets are in similar class and the remaining are in ordered 
class. The observations and theory are quite misaligned in this 
scenario. We attribute this discrepancy to the absence of a com- 
plete and reliable catalogue of observations. 


Our observations catalogue has only 41 multi-planetary sys- 
tems, of which only four host planets inside the EHZ. These 
systems are Trappist-1 (three planets in EHZ), GJ 667 C (two 
planets in EHZ), Solar System (two planets in EHZ), and Tau 
Ceti (one planet in EHZ). The occurrence of architecture classes 
and the frequency with which they host EHZ planets might be 
better constrained with future observations. This may allow us 
to have a better estimate of the occurrence rate of EHZ planets 
as a function of architecture class. 


Simulations suggest that ordered architecture is a rare out- 
come of planet formation (about 1.5% of systems out of 1000 
were deemed to be ordered) and yet, we live in an ordered sys- 
tem. These two statements can shed new light on the rarity of life 
in the galaxy. We foresee that the famous Drake equation may be 
suitably modified to take into account the occurrence rate of dif- 
ferent architectures and thereby set more optimal constraints on 
Ne (Sarkar 2022). 


Since water plays a fundamental role for life forms on Earth, 
it is interesting to probe the core water-ice fraction for the EHZ 
planets. Figure 14 also shows the fice distribution for EHZ plan- 
ets in the Bern model. As we see before, most of the EHZ plan- 
ets are in the similar class and ~ 1% of EHZ planets are in the 
mixed class. EHZ planets in similar systems are ‘dry’, ’ ‘moist’, 
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and ‘wet’. In stark contrast, EHZ planets in mixed class are only 
‘wet’ planets. We hope these results may be useful in guiding 
future missions in finding EHZ planets that have the potential to 
harbour life. 


7. Summary, conclusions, and future work 


In this paper, we introduce and explore a new framework for 
studying the architecture of planetary systems. Our new frame- 
work allows us to study, quantify, classify, the global architecture 
of an entire planetary system at the system-level; and compare 
the architecture of one planetary system with another. In Sect. 3, 
we detailed the new architecture framework and presented an in- 
depth discussion comparing our framework with other works in 
the literature. We present the coefficient of similarity and the co- 
efficient of variation as two quantities that quantify our concep- 
tual ideas. Our framework gives rise to a new parameter space 
(the Cs versus Cy plane) in which individual planetary systems 
can be compared with one another. Throughout this paper, we 
applied this framework to study the distribution and arrange- 
ment of several planetary quantities within a planetary system, 
thereby understanding the system architecture for that quantity. 
In this manner, we studied the mass architecture, the radius ar- 
chitecture, the core mass architecture, the core water mass frac- 
tion architecture, and the density architecture of synthetic and 
observed planetary systems. 


To study some consequences of this framework, we applied 
our method to several catalogues of planetary systems (intro- 
duced in Sect. 2). We curated, especially for the purposes of this 
study, a catalogue of observed multi-planetary systems that have 
four or more planets and include mass measurements for at least 
four planets. For engendering further studies, additional stellar 
and planetary properties were collected and presented in Table 1. 
We also used synthetic planetary systems simulated via the Bern 
model. To facilitate a comparison of theory with observations, 
we prepared three synthetic observed catalogues by applying the 
detection biases on the simulated planetary systems. This led to 
the Bern RV Multis, Bern KOBE Multis, and the Bern Compact 
Multis catalogue. We note that there are caveats present in the 
datasets we used. The model-dependent results we present here 
may be improved upon in future studies using better theoretical 
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models and a more complete observational catalogues (e.g. from 
PLATO). 
Summary of architecture framework: 


1. The architecture framework is model-independent and there- 
fore does not suffer from any caveats emerging from planet 
formation theory or observations. 

2. The same architecture framework can be used to study the 
multi-faceted aspects of planetary system architecture. When 
the framework is applied to study planetary masses, the 
framework informs us of the mass architecture of the sys- 
tem, namely, the arrangement and distribution of masses in 
the planetary system. In this way, we can use this framework 
to study the mass architecture, radii architecture, eccentricity 
architecture, and so on for the same system. In this series of 
work, we identified the architecture of a system with its bulk 
mass architecture. 

3. Planetary system architecture can be one of four classes that 
are derived from our framework: similar, mixed, ordered, and 
anti-ordered. 

4. A planetary system’s architecture is of similar class when the 
masses of all the planets within such a system are similar to 
each other. This architecture class corresponds to the ‘peas 
in a pod’ architecture trend reported in the literature. 

5. The architecture class of a planetary system is ordered (or 
anti-ordered) when the planetary masses in such systems 
tend to increase or decrease from inside-out. 

6. Planetary systems of mixed class architecture host planets 
whose masses show broad increasing and decreasing varia- 
tions. 


Our key model-dependent findings are as follows: 


1. Frequency of architecture class: Systems with similar bulk 
mass architecture are the most common outcome of simula- 
tions, followed by the other three architecture classes. Our 
model suggests that similar architecture should be the most 
common exoplanetary system architecture in our Galaxy and 
beyond. This explains why radius similarity in exoplanets 
was already detected from the first four months of Kepler 
data (Lissauer et al. 2011). 

2. Distance bi-modality: We found hints of a bi-modality in 
the exoplanetary distance distribution arising from the two 
different modes of orbital migrations. This bi-modality is 
readily visible (see Fig. 7) for similar and mixed mass ar- 
chitecture exoplanetary systems observed via RV. 

3. Core mass architectures: We found that for most systems, 
the bulk mass architecture is inherited from the core mass ar- 
chitecture. In addition, the accretion of gases tends to high- 
light the underlying core mass architecture by amplifying it. 
In this way, the observed mass architecture of a system could 
serve as a gateway for studying the distribution and arrange- 
ment of the planetary core masses, which tends to be simpler 
for theoretical modelling. 

4. In situ formation: We found that most planets belonging to 
the similar bulk mass architecture class form in situ inside 
the ice line. In contrast, planets inside the ice line belong- 
ing to mixed, anti-ordered, and ordered show large inward 
migrations. 

5. Core water-ice mass fraction architectures: Synthetic 
planetary systems were found to have two scenarios for their 
core water mass fraction architecture. The default scenario 
consists of relatively more dry planets in the inner parts of 
a system and more wet planets in the outer parts of the sys- 
tem. This is probably a direct consequence of the starting 


location of planets: planets starting inside (or outside) the 
ice line tend to be dry (or wet). About one-fifths of simulates 
systems do not follow the default scenario described above. 
We propose the 'Aryabhata formation scenario’ to explain 
their core-water mass fraction architecture (see Paper II). 

6. Linking architecture and internal composition: We found 
that wet planets are more likely to survive around high- 
metallicity stars. Among other predictions, we showed that 
anti-ordered observed systems should be rich in wet worlds, 
while ordered observed systems are expected to have many 
dry planets (based on the core-accretion planet formation 
paradigm). 

7. Density Architectures: Synthetic systems that are ordered 
(or anti-ordered) in mass tend to also be ordered (or anti- 
ordered) in their bulk densities. Some mass similar systems 
may also have low dispersion in their planetary bulk densi- 
ties. The density architecture is sensitive to the Aryabhata’s 
number (i.e. the starting location of various surviving plan- 
ets; see Paper II). The density architecture of observed sys- 
tems is in good agreement with the density architecture of 
synthetically observed simulated systems. Detection biases 
seem to favour the discovery of planetary systems where the 
densities show anti-ordering, mixing, or similarity. 

8. Radius architectures: The radius architecture of most plan- 
etary systems closely follows their mass architecture. There- 
fore, most mass similar systems also show similarity in ra- 
dius (also for mass mixed, ordered, or anti-ordered systems). 
However, this is not always true. Future studies can calibrate 
a classification scheme based on planetary radii. 

9. Habitability as a system-level phenomenon: We reflected 
on the prospect of studying habitability as a function of 
system-level properties such as system architecture. Similar 
architecture systems represent an excellent observation tar- 
get for finding life outside the solar systems because these 
systems tend to host many more planets inside the empirical 
habitable zone that other architecture classes. 

10. The current version of the Bern model seems to have dif- 
ficulty in producing planets inside the EHZ of an ordered 
architecture system. Nevertheless, more data is required to 
conclude whether the existence of Earth, an inhabited planet 
in an ordered system, is an exception or whether there are 
additional gaps in our understanding of planet formation. 


This paper is the first in a series. The current work presents a 
new testing ground, the architecture space, for theoretical mod- 
els and for comparing observations with theory. We can now 
constrain our understanding of planet formation not only on the 
level of an individual planet — but at the global systemic level. 
This is a multi-faceted approach, since the system architecture 
of several quantities can now be uniformly assessed and com- 
pared with observations. In our next paper (Paper II), we show 
another important aspect emerging from this architecture frame- 
work which asserts that systems with comparable architecture 
often have the same formation pathways. We present ideas to 
further the nature versus nurture debate around planet forma- 
tion. While similar architectures are usually a product of their 
starting conditions, stochastic multi-body effects are responsible 
for shaping the other three architecture classes. This work leads 
to several future studies which will be presented in other papers 
in this series. Davoult et al. (in prep.) explore how the present ar- 
chitecture framework can be employed for an efficient usage of 
telescope time to hunt for habitable worlds. Other possible ex- 
plorations that emerge from this work include: (a) a data-driven 
approach to classifying planetary architecture based on radii and 
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(b) a suitable modification to Drake’s equation that accounts for 
the empirical occurrence rate of system architectures. 
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Appendix A: Bern Model: Additional details 


In this section, we provide some additional details on the physics 
included in the Bern model and how it is utilised to simulate syn- 
thetic planetary systems. Finally, we give an overview on com- 
parisons between the output of the Bern Model and observed 
planetary systems. For the historic development, we refer to Al- 
ibert et al. (2004, 2005); Mordasini et al. (2009); Alibert et al. 
(2011); Mordasini et al. (2012a,b); Alibert et al. (2013); Fortier 
et al. (2013); Marboeuf et al. (2014b); Thiabaud et al. (2014); 
Dittkrist et al. (2014); Jin et al. (2014) and reviews in Benz et al. 
(2014); Mordasini (2018). 

The Bern model is based on the core accretion paradigm 
of planetary formation (Pollack et al. 1996). The model in- 
cludes stellar evolution for a solar-mass star, using evolution 
tracks from Baraffe et al. (2015). The star interacts with the 
protoplanetary disk and influences its thermodynamical proper- 
ties. The protoplanetary disk has two phases: gas and solid. We 
model this disk using the approaches of viscous angular momen- 
tum transport (Lynden-Bell & Pringle 1974; Veras & Armitage 
2004; Hueso & Guillot 2005). Turbulence is characterised by the 
Shakura & Sunyaev (1973) approach, with a = 2 x 10 3. Gas 
from the disk is accreted by planets, host star, and lost via photo- 
evaporation. The 1D geometrically thin disk evolution is studied 
up to 1000 au. The initial mass of this gas disk and its lifetime 
are randomly drawn for each run of the simulation. The solid 
phase of the disk is composed of a swarm of planetesimals. The 
solid disk is modelled as a fluid which evolves via (a) accretion 
by growing planets; (b) interaction with the gaseous disk; (c) dy- 
namical stirring from planets and other planetesimals; and so on 
(Fortier et al. 2013). The initial mass of the solid disk depends 
on the metallicity of the star and also on the condensation state 
of the molecules in the disk (Thiabaud et al. 2014). The host star 
metallicity is randomly drawn for each run of the simulation. 

We added 100 protoplanetary embryos to the protoplanetary 
disk. The initial location of each embryo was varied from one 
simulation to another. It was also ensured that no two embryos 
start within 10 hill radii of each other (Kokubo & Ida 1998, 
2002). Embryos accrete from their feeding zones and any over- 
lap may lead to competition (Alibert et al. 2013). The accretion 
rate depends on the collision probability between a protoplanet 
and a planetesimal, which in turn is influenced by the dynamical 
state of the solid disk. 

Gas accretion occurs in several phases (Mordasini et al. 
2012b). Initially, the gas disk smoothly transitions as a gaseous 
envelope around all planets — the attached phase. For planets that 
are massive enough to undergo runaway gas accretion, the rate 
of gas supply from the disk may not be enough. In these scenar- 
ios, the planet detaches from the gas disk and rapidly contracts 
to Rz. After the gas disk dissipates, all planets are in the isolated 
phase. Gas accretion from the disk is no longer possible and in 
this phase, the planets contract and cool. For all the planets, their 
internal structure is modelled at each time step. We assume plan- 
ets are spherically symmetric and composed of accreted materi- 
als that arranges itself in layers: iron code, silicate mantle, water 
ice, and H/He gaseous envelope (if accreted). 

Next, we use these recipes to simulate several thousands of 
planetary systems in an approach called population synthesis 
(Emsenhuber et al. 2021b). We start 1000 star-disk-embryo sys- 
tems with some fixed as well as some randomly drawn proper- 
ties. The initial properties are inspired by observations of disks 
Tychoniec et al. (2018). The then numerically modelled these 
systems, endowing them with additional physics at the same 
time. Numerically, we incorporated multi-body dynamical in- 
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teractions via N-body simulations. Planet-disk interactions lead- 
ing to orbital migration and eccentricity and inclination damping 
were also incorporated in the N-body Coleman & Nelson (2014); 
Paardekooper et al. (2011); Dittkrist et al. (2014). We followed 
these numerically intensive steps for 20 Myrs and then stopped 
the N-body calculations. The model then continued to evaluate 
the internal structure of all planets in the system for 10 Gyrs. 

The recent version of these simulations has been published 
in the New Generation Planetary Population Synthesis (NGPPS) 
series of papers (Emsenhuber et al. 2021a,b; Schlecker et al. 
2021a,b; Burn et al. 2021; Mishra et al. 2021). The output of 
these models have been compared with observations in several 
works. Drazkowska et al. (2022) compares the occurrence rates 
of synthetic systems with observations. Schlecker et al. (2021a) 
studies the warm Super Earth and cold Jupiter correlation in the 
synthetic systems. Mishra et al. (2021) analyse the ’peas in a 
pod’ architecture and compare synthetic systems with observa- 
tions from Weiss et al. (2018). Mulders et al. (2018) present a 
detailed comparison of the synthetic models with Kepler obser- 
vations. 


Appendix B: Stellar and planetary data references 


1. Sun: Archinal et al. (2018); Standish (1992); Wang et al. 
(2018); Helffrich (2017); Jacobson et al. (2006); Jacobson 
(2014, 2009) 

2. Trappist-1: Agol et al. (2021); Gillon et al. (2017); Burgasser 
& Mamajek (2017); Grimm et al. (2018) 

3. TOI-178: Leleu et al. (2021) 

4. HD 10180: Lovis et al. (2011); Kane & Gelino (2014) 

5. HD 219134: Seager et al. (2021); Bonfanti & Gillon (2020); 
Vogt et al. (2015) 

6. HD 34445: Vogt et al. (2017) 

7. Kepler-11: Berger et al. (2020); Lissauer et al. (2013) 

8. Kepler-20: Fressin et al. (2011); Buchhave et al. (2016) 

9. Kepler-80: MacDonald et al. (2016); Shallue & Vanderburg 

(2018) 

. K2-138: Lopez et al. (2019) 

11. 55 Cnc: Bourrier et al. (2018) 

. GJ 667 C: Anglada-Escudé et al. (2013) 

13. HD 158259: Hara et al. (2020); Gáspár et al. (2016) 

. HD 40307: Díaz et al. (2016); Stassun et al. (2019) 

15. Kepler-102: Berger et al. (2020); Marcy et al. (2014) 

. Kepler-33: Berger et al. (2020); Lissauer et al. (2012); Had- 

den & Lithwick (2017) 

. Kepler-62: Berger et al. (2020); Borucki et al. (2013) 

. HD 20781: Udry et al. (2019) 

. TOI-561: Lacedelli et al. (2021); Weiss et al. (2021) 

. DMPP-1: Staab et al. (2020) 

. GJ 3293: Astudillo-Defru et al. (2017) 

. GJ 676 A: Sahlmann et al. (2016); Stassun et al. (2017) 

. GJ 876: Trifonov et al. (2018); Rojas-Ayala et al. (2012) 

. HD 141399: Hébrard et al. (2016) 

. HD 160691: Gozdziewski et al. (2007); Pepe et al. (2007) 

. HD 20794: Gozdziewski et al. (2007); Pepe et al. (2007) 

. HD 215152: Gozdziewski et al. (2007); Pepe et al. (2007) 

. HR 8799: Marois et al. (2008); Gravity Collaboration et al. 

(2019); Swastik et al. (2021) 

. K2-266: Rodriguez et al. (2018) 

. K2-285: Rodriguez et al. (2018) 

. Kepler-89: Berger et al. (2020); Weiss et al. (2013) 

. Kepler-106: Berger et al. (2020); Marcy et al. (2014) 

. Kepler-107: Berger et al. (2020); Bonomo et al. (2019) 

. Kepler-223: Berger et al. (2020); Mills et al. (2016) 

. Kepler-411: Berger et al. (2020); Sun et al. (2019) 

. Kepler-48: Berger et al. (2020); Marcy et al. (2014) 

. Kepler-65: Berger et al. (2020); Mills et al. (2019) 

. Kepler-79: Berger et al. (2020); Yoffe et al. (2021) 

. WASP-47: Vanderburg et al. (2017) 

. tau Cet: Vanderburg et al. (2017) 

. HD 164922: Benatti et al. (2020); Rosenthal et al. (20212) 
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Fig. C.1. Maximum value of the coefficient of similarity (blue) and the 
theoretical maximum value of the coefficient of variation (orange) is 
plotted against the maximum tolerance, t. 


Appendix C: Derivation of limits 


We consider a set Q of quantities q, namely, Q = {qi} where 
qi could be the mass, radius or other parameter of a planet, and 
the index, i € [1,7], identifies a planet (with 1 being the inner- 
most planet). We assume that all g; 2 0. The quantities q; are 
expressed as: 


-q (051). (C.1) 


The quantities, g;, are decomposed around some value q’ such 
that all t; are minimised; t; is a measurement of the fractional 
difference (or tolerance) between q’ and q;. Since all individual 
tolerances are a positive quantity, they will satisfy the following 
relation: 


O<t<t. (C.2) 
Appendix C.1: Mean 
Let us consider the mean of the quantities, g;: 
zr Q = 2.idi 
qim = 
n n 
E 
= —(ntj fot). (C.3) 
n 


The mean takes its maximum value only when all individual f; 
values take their maximum and are added up. This gives: 


max d; = q' (1 4 f). (C.4) 
Similarly, the minimum value of the mean is: 
min d; = q' (1 t). (C.5) 


The extreme value of the mean occurs when all the individual 
quantities are extremised. However, in this scenario, since all 
quantities are equal, the coefficient of variation is identically 0. 
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Appendix C.2: Coefficient of similarity 


We start with the definition of the coefficient of similarity, 


i-n-l 
1 dix 
z= log — |. : 

C@= 7 2, [vo ^ (C.6 
Inserting Eq. C.1, in the definition, we get: 

i-n-l ltt 
Cs(q) = — |. C7 
s@=— 2 [vo peus (C7) 


This formulation shows that the coefficient of similarity depends 

only on the fractional differences (tolerances) between q; values 

— and not on their actual values. This is a desirable property. 
Next, we evaluate the max Cs as, 


i-n-l 
1 lti 
l 
: » [vs LI I 
PH los lti 
lti j 


i=1 
1+ t | 


max Cs (q) 


Il 

E 

£g 

X 
— 


= max 
n-— 


p xml 


emm 2, log max 


In the first step, we commuted the max operator with the frac- 
tion (n — 1)7! because we are interested in the maximum for a 
constant n. Next, knowing that the maximum of a sum occurs at 
the sum of maximum summands and that /og is a monotonically 
increasing function, we further commute the max operator. 

We observe the following: 


[ae (C.8) 


max oval when { 2 E: E \ . (C.9) 
This implies that 

max Cs(q) = log aes 

min Cs(q) = -max Cs = log IT (C.10) 


where the second equality can be similarly derived. Fig. C.1 
shows the variation of max Cs as a function of tolerance t. We 
note that the limits of the coefficient of similarity do not depend 
on n, and we verified our results with numerical simulations. M 


Appendix C.3: Coefficient of Variation 
We start with the definition of the coefficient of variation, 


Gg SS 
q 


(C.11) 
and we note that the minimum value of the coefficient of vari- 
ation is zero and it occurs when all q; values are equal, thereby 
giving no variance. 

In the literature, we can find some derivations for the max- 
imum value of the coefficient of variation (Katsnelson & Kotz 
1957; Sharma et al. 2010). Katsnelson & Kotz (1957) show that 
the upper limit of the coefficient of variation is Vr — 1 when all 
but one q; is zero. However, this limit is only a particular case 
of our formulation (specifically, qı = q’ and giz: = 0). Here, we 
derive the limits for a more general scenario. 
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We consider that: 


DN qi Y 
de Pen 


i=1 


(C.12) 


=A 
Here, we have squared the definition of Cy and used the def- 
inition of the standard deviation o(g). As an aside, we note 
that the equation above shows that the coefficient of variation 
is zero when all q; = q, as noted before. We note that the max- 
imum value of C occurs when the term A (in parenthesis) is 
maximised. Denoting >);-; g; by Q, we consider the term in the 
parenthesis, 


A21- Q7" 


Q Q 


The condition for the general maxima of the coefficient of 
variation, in our formulation, is when one of the quantity (say qı 
takes the largest possible value, while all others take the smallest 
possible value): 

qi-q (+t) 
disi =g (1-2). 
The mean in this scenario becomes (marked with ”): 
EL q(1*t--(n—-1)xq(1- 1) 
q = 
n 


(C.13) 


(C.14) 


= z [na =; +24). (C.15) 


The variance in this scenario becomes (marked with "): 


1 2 2 
o" (q) = Hga +)-a'| + @-v[gd-9-7"] H 


agn) = cH 
(C.16) 
This gives: 
2q't 
c" (q) = 1 )vn T]. (C.17) 


Finally, the general expression for the maximum value of the 
coefficient of variation becomes: 


G"(qQ | 2t Nn-l 
d"  n(1-0D42t 


This expression recovers the particular case derived in lit- 
erature when we set ¢ = 1. From this expression, we note that 
the upper limit of the coefficient of variation does not depend on 
the actual values of the quantities, but it depends on the number 
of quantities in the set, Q, and the maximum tolerance, t. This 
new formulation allows us to extract the upper limit of the co- 
efficient of variation for any set whose maximum tolerance, f, is 
known. Interestingly, the above expression gives appropriate re- 
sult when absurd inputs are considered. For example, when there 


are no planets in a system, max C vio = \-1, and when there 


max Cy(q) = (C.18) 


is only one planet in a system, max C vla = 0. For a system of 
two planets, the upper limit is exactly the fractional difference 
(or tolerance), that is, max Cy|,-» =t. 

Furthermore, varying over n, and assuming f € [0, 1), allows 
us to derive the theoretical maximum possible value for the co- 


efficient of variation. This occurs at n = ;5 and gives: 


(C.19) 


Figure C.1 shows the variation of the theoretical max, Cy, as a 
function of tolerance f. 
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Appendix D: Classification boundaries for 
architectures classes 


In this section, we present some considerations that motivate 
the boundaries between the four architecture classes for plan- 
etary masses. In the current formulation (Eq. 3), there are two 
boundaries that need to be identified. We deal with the distinc- 
tion between similar and mixed class first, and then distinguish 
ordered/anti-ordered architecture classes. 


Appendix D.1: Similar versus mixed 


We saw in Sect. 3.2, it is difficult to distinguish between mixed 
and similar architecture classes using the coefficient of similar- 
ity alone. Mixed systems are characterised by large increasing 
or decreasing variations, which may cancel each other out and 
lead to small values of C; (M). Nevertheless, the coefficient of 
variation can distinguish between large variations in values. Fig- 
ure D.1 shows the Cy(M) as a function of the number of planets 
in a planetary system. The left panel shows all synthetic sys- 
tems from the Bern model, while we only show systems with 
ICs (M)| € 0.2 in the right panel. 

Clearly, there are two clusters of planetary systems. The clus- 
ter on the lower right-hand side corresponds to similar class sys- 
tems. Mixed systems, having large values of Cy(M), are spread 
over the top left region. It is clear that the boundary between 


similar and mixed classes depends on the number of planets. The 


black line (corresponding to y = Vect) neatly separates the two 


clusters. We have chosen this equation to disentangle similar ar- 
chitectures from the mixed class. This equation has, incidentally, 
two key properties: 1) it ensures that no two planet system can 
be of mixed architecture and 2) it happens to be exactly half of 
the maximum possible value of the coefficient of variation. 


Appendix D.2: Ordered and anti-ordered 


Having motivated the boundary between similar and mixed 
class, we are now left with three groupings of architecture 
classes. These three groupings correspond to Cs (M) << 0 (anti- 
ordered), Cs(M) ~ O (similar/mixed), and Cs(M) >> 0 (or- 
dered). This suggests that we require two boundaries to distin- 
guish these three groups. However, we posit that the boundaries 
between ordered and anti-ordered should be symmetric around 
0. Thus, we are left with only one boundary. 

ordered (or anti-ordered) systems differ in their architec- 
ture from similar/mixed classes in that the quantity (mass here) 
continues to show an increasing (or decreasing) trend with dis- 
tance. For all planetary systems in the Bern model, we measure 
the Spearman correlation coefficient, R, between the planetary 
masses and their distance from the host star. The Spearman R, 
measuring the monotonicity between two datasets, varies from 
-] to +1, with 0 indicating no correlation. A positive correlation 
implies that as x increases, so does y. Negative correlations im- 
ply that as x increases, y decreases. 

Figure D.1 shows the Cs(M) of synthetic systems as func- 
tion of their Spearman correlation R (mass and distance). We 
note that there is a large cluster of points towards Cg (M) ~ 0. 
This group corresponds to the similar and mixed architecture 
classes. There are some points to the top right (including those 
with R = +1 — corresponding to planetary systems in which plan- 
etary masses are monotonically increasing with distance). There 
is a scatter of points towards the bottom-left (including some 
systems with R = —1). 
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lassification boundaries for architecture classes. Left: Boundary between similar and mixed class. The panel show the coefficient of 
variation for synthetic planetary systems as a function of the number of planets in a system for systems with |Cs (M)| < 0.2. Two 
clusters are clearly distinguishable, allowing us to fix the boundary between the similar and mixed architecture classes. Right: 
Boundary between ordered and anti-ordered. This plot shows the coefficient of similarity of synthetic planetary systems as a 
function of the Spearman correlation coefficient between the planetary masses and distances of that system. Thick horizontal lines 
correspond to potential boundaries. 


First, we note that the comparison of the coefficient of sim- 
ilarity with Spearman R fulfils some expectation. For example, 
there are no points in bottom-right or top-left sections of this 
plot. Second, our objective is to isolate the central cluster of 
points from all other scattered points. We note that |Cs (M)| = 0.1 
fails as a boundary, since it does not include the full central clus- 
ter. Both |Cs(M)| = 0.2 and 0.3 could succeed. Going beyond, 
a value of 0.3 would add many unnecessary points to the central 
cluster. 

To further motivate our choice of boundary, namely, 
ICs (M)| = 0.2, we show the mass-distance diagram of 12 ran- 
domly selected systems with —0.3 « Cs(M) < —0.2 (out of 19) 
in Fig. D.2. We note that all systems show the qualitative fea- 
tures of an anti-ordered system, namely, massive planets in the 
inner region and small planets in the outer region. Since all of 
these planets have their C; (M) < —0.2, we use |Cs(M)| = 0.2 
as a boundary between ordered, anti-ordered, and similar--mixed 
architecture classes. Future works may explore improvements to 
our selected boundaries using additional ideas from K-means or 
hierarchical clusterings. 
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Fig. D.2. Mass-distance diagram. This plot shows the planetary masses 
as a function of distance for some planetary systems with —0.3 « 
Cs(M) < —0.2. The dashed line connects that planets in the system 
and serves to highlight the arrangement and distribution of masses. The 
size of each circle corresponds to the planet's radius and the colour of 
each planet also shows its core water mass fraction. 
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Fig. E.1. A gallery of planetary system architectures. These plots show the mass-distance diagram for similar (left) and mixed (right) planetary 
systems from the Bern Model. Each circle represents a planet, its size corresponds to the planetary radius, and its colour represents the fraction of 
ice in the planetary core. Each panel shows the Cs (M) as well as the Cy(M) of the system. 
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Fig. E.2. A gallery of planetary system architectures. These plots show the mass-distance diagram for anti-ordered (left) and ordered (right) 
planetary systems from the Bern Model. Each circle represents a planet, its size corresponds to the planetary radius, and its colour represents the 
fraction of ice in the planetary core. Each panel shows the Cs (M) as well as the Cy(M) of the system. 
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